Site Reliability Engineer
1 month ago
Research and resolve technical issues found in production with immediate and long-term solutions. Conveys problems, solutions, updates and project status to peers and management.
With limited direction, develop code, test, debug, maintain, modify, and install moderately complex software programs. Use knowledge of available utilities to simplify tasks. Apply structured programming techniques and adhere to coding standards. Collaborate with development and QA to ensure that features and functions are not adversely impacted.
Document final solutions, including root cause analysis.
Maintain a patch code branch for each fix. Identify which need to be product changes and which are infrastructure, deployment or build changes.
Advocate for your changes with follow-through to development and other teams by ensuring that patches are tested and migrated to the code base and/or process changes are documented appropriately and timely.
Improve process and tools to automate deployments and other operational tasks (i.e. monitoring application components) to improve efficiency and consistency. use your knowledge to create scripts, tools, utilities and/or code to address issues with new projects and to resolve existing operational procedures (i.e. single-click deployments).