Vacancy caducado!
Software Guidance & Assistance, Inc., (SGA), is searching for a Remote Infrastructure Engineerfor a Contract-to-Permassignment with one of our premier Healthcare Servicesclients. W2 ONLY - NO 3RD PARTIES 6 Month contract - right to hire Open to candidates in all US time zones. 100% remote (will NOT be required to work onsite even when converted to fte) Responsibilities :
- Participate in short- and long-term planning efforts with stakeholders and IT groups.
- Identify and communicate how IT infrastructure solutions can support the achievement of short- and long-range business goals.
- Support the prioritization of requirements and help match resources with requirements.
- Work with analysts, architecture, and stakeholders to understand business needs.
- Lead the evaluation of technical requirements for projects to determine the impact to the infrastructure including equipment redundancy and capacity requirements.
- Ensure completeness of technical requirements and functional architecture analysis for the design and implementation of system business solutions.
- Identify requirements gaps or issues.
- Determine systems specifications, input/output processes and working parameters for hardware/software compatibility.
- Determine requirements impact on existing architecture, work processes and systems.
- Evaluate technical requirements for projects to determine the impact to infrastructure/applications including equipment redundancy and capacity requirements.
- Determine technical requirements' impact on existing architecture, work processes, systems, and ongoing support.
- Explain to non-SMEs how the proposed solution will support their requirements.
- Assist in the business process redesign and documentation as needed for new technology.
- Lead the Architecture, design, development and test of technical solutions or infrastructure solutions to meet business requirements and functional specifications.
- Ensure that tests evaluate all possible impacts on the current infrastructure.
- Coordinate and lead the build and deployment and review of new, modified or enhanced infrastructure components or services.
- Ensure all support documentation knowledge transfer to production support.
- Verify the functionality of components and services and ensure deployment meets client's expectations.
- Establish requirements, methods and procedures for routine maintenance.
- Ensure performance meets the present and future needs of the business.
- Forecast utilization patterns and identifies modifications or upgrades.
- Recommend changes/enhancements for improved systems availability, reliability and performance.
- Develop and maintain metrics around the system and institutes a process for continuous improvement.
- Conduct reviews periodically with users and vendors.
- Define and ensure continuous monitoring procedures are set according to the standard procedures and requirements.
- Create a plan to evolve the system to reduce cost and improve system dynamics.
- Perform or coordinate Level 3/4 incident assessment and resolution on infrastructure solutions.
- Coordinate problem management and resolution among a variety of functional areas and provides subject matter expertise support for diagnosing and resolving problem.
- Recommend procedures and controls for problem resolution or create temporary solutions until permanent solutions can be implemented.
- Research, analyze and recommend the implementation of software or hardware changes to rectify any current or similar future problems.
- Review checklists and scripts and update as needed.
- Lead the development of contingency plans including reliable backup and restore procedures.
- Identify business continuity/disaster recovery risks and mitigation plans.
- Assist in the development of disaster recovery plans with service providers and network carriers.
- Support and establish systems environment standards.
- Work with auditors and security groups to ensure adherence to governance, regulations, and compliance with policies and procedures.
- Evaluate vendor solutions to ensure compliance with requirements and cost-effectiveness.
- Evaluate future technologies and makes recommendations.
- Review vendor proposals for new infrastructure solutions.
- 7+ years of experience in deploying, maintaining, and troubleshooting complex applications at an enterprise-scale
- 5-10 years of relevant experience in an Engineering & IT lead role
- Ability to converse with application owners, architects, performance testers to collect requirements, pinpoint application performance bottlenecks via the monitoring & observability tools, and deliver enterprise-scale solutions
- Drive operational efficiencies, process change, and improvements in a challenging environment while maintaining good working relationships and overcoming resistance to change
- Demonstrate comprehensive knowledge of design metrics, analytics tools, benchmarking activities, and related reporting to identify and implement best practices
- Provide strong leadership that utilizes clear and concise written and verbal communication to an audience that can vary from C-level execs to individual contributors in support teams
- Demonstrate comprehensive knowledge of design metrics, analytics tools, benchmarking activities, and related reporting to identify and implement best practices
- Experience working with diverse stakeholders, including operations, application developers, SMEs and performance testing in a consultative manner
- Strong knowledge of systems, networks, hardware, and software from an automation & monitoring standpoint.
- Must have hands-on experience of at least 2+ years in three of six tools categories:
- Monitoring
- Application Performance Management
- Real user monitoring
- Infrastructure monitoring
- Network performance andfault management
- Log aggregation/analysis tools
- 5+ years monitoring and troubleshooting experience with APM tools (e.g., New Relic, AppDynamics, DynaTrace, Splunk, Aternity, SolarWinds, or comparable platforms)
- 2+ years working ELK, Splunk, New Relic (or alternative log analysis/analytics tools)
- 2+ years' experience with synthetic monitoring tools (ideally New Relic, Pingdom, or other synthetic monitoring tools)
- 2+ years' experience working with ServiceNow incident, problem, and change modules along with integration with enterprise management tool(s)
- Knowledge of and experience with monitoring via API, WMI, SNMP, SSH, and other programming languages and/or protocols
- Experience with Agile delivery utilizing modern cloud-based software tools and approaches (Git, CI/CD, JIRA, automated testing)
- Experience with Puppet, Puppet Bolt, or other orchestration tools
- Experience managing, tuning, and maintaining the right balance of alerting that enables/maintain our observability strategy while eliminating non-actionable issues
- Big Data / AIOPS experience is a strong plus to drive and support incident intelligence and observability