Skip to main content

Reliability Production Engineers (RPE)

Full Time
Mumbai
Posted 2 years ago

As a member of the Production Management team we need you to bring your leadership skills and passion for technology to enable the team to operate more efficiently in a fast paced environment, and to help us provide best in class services to the firm’s clients across Business Units. The candidate should be adaptive to a continuous changing environment, be able to successfully multi-task, and enjoy the pressure and stress of a fully engaged production management role. This is a diverse role which provides great personal development opportunities and will suit an astute candidate with an appetite for responsibility and hands-on technology exposure. This is an exciting opportunity to help shape the next evolution of our Technology platform, working in conjunction with the businesses, developers, data operations, outside counterparties and regulatory bodies. Because this position provides exposure to a wide range of cutting edge technologies and is tightly integrated with our Agile based development teams/business sponsors, the exposure this role provides enables limitless career growth potential.

Primary Responsibilities:

  • Handle production management services including end user support, systems monitoring, incident management and problem management, plant management and event management.
  • Build extensive business and application knowledge required for supporting client facing applications.
  • Diagnose and resolve application issues to ensure optimal performance and usability.
  • Identify and implement automation to reduce toil, improve efficiency and eliminate customer impact.
  • Provide root cause analysis with recommendations for improvements.
  • Configure application monitors using industry standard monitoring tools, as well as developing customized monitoring solutions.
  • Interface with clients and other technology teams to provide governance and control around the production environment.
  • Manage / Drive outage calls and significant incidents; coordinate communications within a trade floor environment.
  • Act as a primary escalation/communication point between Application development teams and Business Units.
  • Initiate, grow and shape processes, procedures and strategies to make the team more efficient.

Required Skills:

  • Strong coding/scripting skills: Python / Perl / Shell (Any Two)
  • Strong debugging and problem solving skills.
  • Exposure to highly Distributed, High availability, Fault tolerant Systems.
  • Deep understating of Database Concepts, SQL Queries and Database Performance.
  • Experience working with application Monitoring/Alerting Tools (Splunk, AppDynamics, Elastic Search etc.).
  • Understanding of schedulers (Autosys, Control M etc.).
  • Experience in building and maintaining Production and/or Non-Production environments.
  • Experience in development tools (GIT, Jenkins, Ansible, Puppet etc.).

Good to have skills:

  • Cloud technologies such as AWS, Azure or GCP.
  • Working knowledge of any NoSQL databases.
  • Knowledge of Micro Service architecture, pub/sub Messaging Queues, Containerization.
  • Experience in financial services / products.

Good to have certifications:

  • 5+ Years:-
  • AWS Azure
  • Scrum master/Agile
  • Any Financial certifications like FRM/NSE
  • Python certificates PCAP/PCEP

HR Search Criteria for Candidates:

Must Have –

  • Production Support
  • SQL
  • Unix/Linux
  • Perl/Python/Shell Scripting
  • Good to Have
  • GIT / Jenkins / Ansible / Puppet / Docker / Chef
  • Splunk / Appdynamics

Job Features

Job CategoryInformation Technology

Apply Online

A valid phone number is required.
A valid email address is required.