Site Reliability Engineer

Local Jobs JPMorgan Chase Bank, N.A.
  • United States, Plano, TX View on Map
  • Post Date : October 23, 2020
  • Apply Before : November 22, 2020
  • Share:

Job Description

As a Site Reliability Engineer (SRE), you’ll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure, and reducing work through automation. You’ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment, you’ll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you’ll be focused on running better production applications and systems.


Responsibilities:

  • Design, code, test, and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Coach or manage teams as applicable
  • Participate in the 24×7 support coverage as needed

Qualifications:

  • Bachelor’s degree or equivalent experience in an software engineering discipline
  • Expertise in at least one technology stack designing, coding, testing, and delivering software
  • Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
  • Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
  • Excellent debugging and trouble shooting skills

JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.

We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as any mental health or physical disability needs.

Equal Opportunity Employer/Disability/Veterans

Other jobs you may like

Site Reliability Engineer

Local Jobs JPMorgan Chase Bank, N.A.
  • United States, Chicago, IL View on Map
  • Post Date : October 23, 2020
  • Apply Before : November 22, 2020
  • Share:

Job Description

As a Site Reliability Engineer (SRE), you’ll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure, and reducing work through automation. You’ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment, you’ll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you’ll be focused on running better production applications and systems.


Responsibilities:

  • Design, code, test, and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Coach or manage teams as applicable
  • Participate in the 24×7 support coverage as needed

Qualifications:

  • Bachelor’s degree or equivalent experience in an software engineering discipline
  • Expertise in at least one technology stack designing, coding, testing, and delivering software
  • Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
  • Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
  • Excellent debugging and trouble shooting skills

JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.

We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. In accordance with applicable law, we make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as any mental health or physical disability needs.

Equal Opportunity Employer/Disability/Veterans

Other jobs you may like

Site Reliability Engineer

Local Jobs Microsoft in Software Development
  • United States, New York, NY View on Map
  • Post Date : October 16, 2020
  • Apply Before : November 15, 2020
  • Share:

Job Description

Site Reliability Engineer, PromoteIQ

Microsoft Advertising

PromoteIQ provides intelligent vendor marketing solutions for the next generation of e-commerce. Our platform helps retailers implement, automate, and scale their brand-funded digital vendor marketing programs. We sit at the intersection of marketing and e-commerce and have a singular mission of empowering retailers and brands to maximize their e-commerce performance.

PromoteIQ embodies a strong startup culture that values diversity, collaboration and craftsmanship – and above all else, results. Our bias towards execution balances critical thinking, root analysis and pragmatic problem solving. We expect a lot from one another and value our thoughtful and intellectually curious company culture.

PromoteIQ is headquartered in New York City and supports a global footprint of e-commerce retailers and brands. The company was acquired by Microsoft in Aug, 2019 and continues to operate as an independent division within Microsoft Advertising. Learn more at https://www.promoteiq.com. This role is based in our SoHo/NYC office.

Microsoft Advertising is a worldwide Sales, Marketing and Services organization on the cutting edge of the digital advertising industry. Microsoft Advertising offers a compelling portfolio of advertising products, innovative solutions and the opportunity to engage with some of the brightest minds in the digital industry. Microsoft Advertising is the destination for experienced, collaborative, and passionate digital advertising professionals seeking a rewarding career and lifestyle.


Who We’re Looking For

At PromoteIQ, DevOps Engineers specialize in developing scalable methods for building, deploying, and supporting our cloud-agnostic enterprise services and systems. This is a highly collaborative role in which you will work closely with our Software Engineers to deploy and operate our solutions; automate and streamline our processes; build and maintain tools for deployment, monitor IT operations, and troubleshoot and resolve issues in our dev, test, and production environments.


Responsibilities

  • Design and build infrastructure & systems that provide high levels of scalability, reliability, and performance for the PromoteIQ’s stack, while balancing security, maintainability, and operational excellence
  • Interface across teams to codify and reliably test infrastructure changes using PromoteIQ’s software development lifecycle
  • Partner with Product and Dev teams to provide guidance and best practices around scalability, reliability, and performance of our production systems, infrastructure, and software
  • Work as a team on escalations, resolving critical issues that impact our highly available dev, test, and production systems
  • Work with a creative engineering team to continuously implement and improve reliable and speedy build environments for DEV & QA; provide timely build status updates; automate as much as possible to improve efficiency and quality
  • Promote innovation, implementation of cutting-edge technologies, outside-of-the-box thinking, teamwork, and self-organization
  • Work with Github Actions or other build tools in a CI/CD process to build and deploy to our cloud-agnostic environment
  • Ensure traceability, observability, and retrievability of sources and deliverables
  • Build logging, monitoring, and alerting systems to identify bottlenecks and assist with debugging, analysis, and optimization in a cloud-agnostic environment
  • Improve operational efficiency through automation and deployment or development of new tools
  • Experiment with and recommend new technologies that simplify or improve PromoteIQ’s stack
  • Craft solid and clearly explained designs, playbooks, and documentation, for consumption by teammates and the larger engineering organization
  • Participate in an off-hours on-call rotation, and perform periodic off-hours work during maintenance windows


Qualifications


Required Qualifications
  • 1+ years of experience in the cloud SRE/Infrastructure, or any related fields
  • 3+ years of experience as an Engineer on a Software Engineering team
  • Experience with cloud-agnostic configuration management frameworks (Ansible, Terraform, etc)
  • Experience configuring and managing cloud infrastructure (AWS, GCP, Azure)
  • Understanding of SSH, VPN, TCP/IP, DNS, HTTP(S), network routing and subnetting
  • Experience with CI/CD pipelines such as Jenkins, Travis, Azure DevOps, TeamCity, etc.
  • System Observability experience (Zabbix, CloudWatch, PagerDuty, Datadog, Azure Monitor, SignalFx, Graphana, etc)
  • Knowledge of Linux (Debian/Ubuntu) architecture, security, administration, performance monitoring/tuning, troubleshooting, and production operations
  • Fluent in Python and Shell Scripting, with experience implementing automation and monitoring using shell scripting and other related tools
  • Experience with containerization technologies (Docker, Kubernetes, etc)


Preferred Qualifications

  • Experience with managing and tuning datastore clusters (Elasticsearch, RDS, MySQL, Aerospike, etc)
  • Knowledge of messaging systems such as Kafka, RabbitMQ, SQS, etc
  • Experience with an always-on and high-volume web server stack (nginx, HAProxy, squid, etc)
This role is based in New York, but we are open to candidates being remote.

#PromoteIQ #MicrosoftAdvertising

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

Other jobs you may like

Go to Top