Luxoft is a global leader in high-end software development.
Luxoft is looking for talents with a passion for technology & ready to create original solutions. Once on board, you are invited to expand your knowledge & skills, offering you a continuous learning experience helping you stretch your potential.
So if you’re enthusiastic by the idea of accessing cutting edge technology & innovation to make an impact, why don't you join us?
Capacity Management consists of processes to ensure that front to back IT systems are better equipped to handle volatile market volumes. Given the client, regulatory and internal audit focus on systems capacity, IB wide Global Capacity Management is now a formal mandate and a dedicated program has been launched to formalize and institutionalize common approach to capacity management. This group would be responsible for delivering proactive Capacity Management capability across all aspects of IT Infrastructure to ensure that the capacity of IT services meets the evolving demands of the business in a cost-effective and timely manner
In this role, the potential candidate will help the Service Continuity and Capacity Teams in a new function, Capacity Analyst and Operator.
In this function, the candidate will be monitoring, analyzing, risk mitigating and escalating Resiliency and Capacity and Performance Alerts ("Insights") affecting Application Infrastructure estate, via the Operational Risk Engine. Specifically:
- Understand key capacity and performance metrics for each application
- Understand Infrastructure components such as Storage, Databases, Servers, Datacenter and Virtual Infrastructure
- Examine the Insights generated by the rules engine and determine if a problem ticket should be raised within the ticketing system. The Rules Engine provides automation & an interface to facilitate the decision process. The candidate will use their knowledge of Capacity and Resiliency as part of the decision making process.
- Able to dissect from raw data, trends and correlate the information with application releases
- identify potential performance constrains caused by abnormal behaviors
- Interact with stakeholders helping them to mitigate risks caused by abnormal behaviors in the Performance of the environment, providing root cause analysis, recommendations and best practices
- Identify and document issues, gaps, and concerns of the users
- Build a list of requirements needed to address these issues such as calibration of alerts and improvements based on empirical data.
- Create Reports and KPIs detailing the findings from the analysis.
- The candidate will be expected to provide periodic feedback, such that processes and procedures can be improved
The candidate should also be well versed with data manipulation preferably the use of Excel, Tableau and any other analytics tool.
- Understand the Service Continuity and Capacity Management program goals, requirements, work flow and ensure its successful execution
- Knowledge of IT Capacity Management and Infrastructure required (datacenter, Storage, servers, databases)
- Production systems knowledge, background knowledge of critical IT infrastructure, good communication and presentation skills, ability to plan, manage/Coordinate projects. ITIL certification and familiarity with IT Systems Architecture a plus, also operations or application support experience a plus.
- Proven experience with Data Analytics
- Disaster Recovery coordination, background knowledge of critical IT infrastructure, good communication and presentation skills, ability to plan, manage/Coordinate projects. ITIL certification and familiarity with IT Systems Architecture a plus.
- At least 4 years' experience as an IT-Data Analysis and/or Support in large and complex projects, preferably in the banking/finance industry
- Strong problem solving skills
- Familiar with Financial Institutions
- Excellent communication/presentation (oral and written) and report writing skills
- Ability to pay attention to details and accuracy
- High degree of flexibility with a "can do" attitude, able to work effectively with limited level of instruction and supervision
- Excellent time management and task management abilities
- Proactive, driven and energetic Capability of handling pressure situation with ease
- Self-motivated person to research on his/her own any technical issue/limitation to be able to walk around and resolve issues.
Technical Skills
- A good understanding of Performance and Capacity Management, especially, on the supporting Infrastructure components such as servers, databases, storage, networks.
Intermediate/Advanced level
- Knowledge of IT Capacity Management and Infrastructure required (datacenter, Storage, servers, databases)
- Strong MS Office skills
- Visualization and Reporting experience, preferably with tools such as Visokio or Tableau a plus
- Very good experience with SQL Server capable of creating complex queries for Data Extraction and Data Transformation, a plus
Enhance existing Operational Risk management processes to enable Production Support teams and Application Owners to identify and action potential capacity and resiliency issues before they actually occur.
Implement changes to Operational Risk management processes for critical applications that will meet existing mandatory government regulatory requirements