Hello,
We have an requirement for below position, kindly check and if interested then please share your updated copy of resume
Role: Site Reliability Engineer (SRE)
Location: Providence/ Rhode Island
Duration: Contract
Job Summary:
Seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to design, build, and maintain scalable, resilient, and high-performing IT infrastructure. The ideal candidate will have deep expertise in cloud and hybrid environments, automation, observability, and infrastructure operations, with a strong focus on minimizing downtime and improving service reliability.
Key Responsibilities:
1. Infrastructure Design & Implementation
Design and implement highly available and resilient infrastructure solutions.
Ensure system scalability, fault tolerance, and disaster recovery across on-premise, cloud, and hybrid environments.
Define and maintain architecture standards and best practices for infrastructure design.
2. Service Reliability & Performance
Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Ensure high availability and performance of critical systems and applications.
Collaborate with cross-functional teams to improve overall system reliability.
3. Incident & Problem Management
Proactively monitor and troubleshoot system and infrastructure issues to reduce Mean Time to Recovery (MTTR).
Lead Root Cause Analysis (RCA) for high-priority incidents (P1/P2) and drive preventive actions.
Maintain documentation for incident response procedures and recovery plans.
4. Automation & Infrastructure as Code (IaC)
Develop and implement Infrastructure-as-Code solutions using tools such as Terraform, Ansible, or similar.
Automate routine and repetitive operational tasks to reduce manual intervention and eliminate toil.
Manage version-controlled infrastructure and perform code reviews for IaC deployments.
5. Observability & Monitoring
Set up and manage monitoring and observability tools (e.g., Grafana, Prometheus, ELK stack, Azure Monitor).
Implement logging, metrics, and alerting to ensure visibility into the health and performance of systems.
Ensure early detection of anomalies and quick resolution of production issues.
6. DevOps & CI/CD Integration
Collaborate with DevOps teams to integrate CI/CD pipelines with infrastructure operations.
Support containerized environments using Docker and Kubernetes.
Maintain and optimize deployment processes for both infrastructure and applications.
7. Collaboration & Leadership
Act as a subject matter expert (SME) and provide guidance throughout the solution lifecycle.
Work closely with clients and stakeholders to understand business needs and design appropriate solutions.
Mentor junior engineers and participate in knowledge-sharing initiatives.
Required Skills & Qualifications:
Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience managing and scaling IT infrastructure in cloud, on-premise, or hybrid environments.
Strong scripting/programming skills in Python, Bash, or PowerShell.
Proficiency in one or more cloud platforms: AWS, Azure, or GCP.
Expertise in automation tools: Terraform, Ansible, Chef, or similar.
Hands-on experience with observability and monitoring tools (e.g., Grafana, ELK, Prometheus).
Solid understanding of networking, virtualization, and storage technologies.
Familiarity with DevOps practices and container orchestration tools (e.g., Docker, Kubernetes).
Excellent troubleshooting, analytical, and problem-solving skills.
Thanks & Regards,
Rutuja Choudhari || US IT Recruiter
Nityo Infotech Corp
(609) 857-8238
rutuja.choudhari@nityo.com
If you feel you received this email by mistake or wish to unsubscribe, kindly reply to this email with "UNSUBSCRIBE" in the subject line
Hello,
We have an requirement for below position, kindly check and if interested then please share your updated copy of resume
Role: Site Reliability Engineer (SRE)
Location: Providence/ Rhode Island
Duration: Contract
Job Summary:
Seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to design, build, and maintain scalable, resilient, and high-performing IT infrastructure. The ideal candidate will have deep expertise in cloud and hybrid environments, automation, observability, and infrastructure operations, with a strong focus on minimizing downtime and improving service reliability.
Key Responsibilities:
1. Infrastructure Design & Implementation
Design and implement highly available and resilient infrastructure solutions.
Ensure system scalability, fault tolerance, and disaster recovery across on-premise, cloud, and hybrid environments.
Define and maintain architecture standards and best practices for infrastructure design.
2. Service Reliability & Performance
Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Ensure high availability and performance of critical systems and applications.
Collaborate with cross-functional teams to improve overall system reliability.
3. Incident & Problem Management
Proactively monitor and troubleshoot system and infrastructure issues to reduce Mean Time to Recovery (MTTR).
Lead Root Cause Analysis (RCA) for high-priority incidents (P1/P2) and drive preventive actions.
Maintain documentation for incident response procedures and recovery plans.
4. Automation & Infrastructure as Code (IaC)
Develop and implement Infrastructure-as-Code solutions using tools such as Terraform, Ansible, or similar.
Automate routine and repetitive operational tasks to reduce manual intervention and eliminate toil.
Manage version-controlled infrastructure and perform code reviews for IaC deployments.
5. Observability & Monitoring
Set up and manage monitoring and observability tools (e.g., Grafana, Prometheus, ELK stack, Azure Monitor).
Implement logging, metrics, and alerting to ensure visibility into the health and performance of systems.
Ensure early detection of anomalies and quick resolution of production issues.
6. DevOps & CI/CD Integration
Collaborate with DevOps teams to integrate CI/CD pipelines with infrastructure operations.
Support containerized environments using Docker and Kubernetes.
Maintain and optimize deployment processes for both infrastructure and applications.
7. Collaboration & Leadership
Act as a subject matter expert (SME) and provide guidance throughout the solution lifecycle.
Work closely with clients and stakeholders to understand business needs and design appropriate solutions.
Mentor junior engineers and participate in knowledge-sharing initiatives.
Required Skills & Qualifications:
Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience managing and scaling IT infrastructure in cloud, on-premise, or hybrid environments.
Strong scripting/programming skills in Python, Bash, or PowerShell.
Proficiency in one or more cloud platforms: AWS, Azure, or GCP.
Expertise in automation tools: Terraform, Ansible, Chef, or similar.
Hands-on experience with observability and monitoring tools (e.g., Grafana, ELK, Prometheus).
Solid understanding of networking, virtualization, and storage technologies.
Familiarity with DevOps practices and container orchestration tools (e.g., Docker, Kubernetes).
Excellent troubleshooting, analytical, and problem-solving skills.
Thanks & Regards,
Rutuja Choudhari || US IT Recruiter
Nityo Infotech Corp
(609) 857-8238
rutuja.choudhari@nityo.com
If you feel you received this email by mistake or wish to unsubscribe, kindly reply to this email with "UNSUBSCRIBE" in the subject lin
Job Description Create an outstanding customer experience through exceptional service. Establish and maintain a safe and clean environment that encourages our customers to return. Assist the department manager in reaching sales and profit goals established for the department...
...one on one with your patient in the home health setting? If so, Private Duty Nursing might be a good fit for you!ChristianaCare HomeHealth... ...medications and prescribed treatments as orderedRequirements:DE LPN or RN or license with one year experience working in a healthcare...
Salary: $55 / HourThe Occupational Therapist (OT) is responsible forperforming student evaluations, developing and providing therapy services,... ...meaningful opportunities to our extensive network of healthcare and school-based professionals, ready to work in any hospital,...
...We are currently seeking a skilled B-Level Auto Body Technician to join our team. This individual will work in a fast-paced environment, handling collision repairs on a variety of vehicles. The ideal candidate will have a strong understanding of automotive repair procedures...
Nurse Apprentice Location Guntersville, AL : Job Summary Statement The following statements reflect the general duties considered necessary... ...may be inherent in the position. The Apprentice Nursing Student works under a supervising licensed nurse serving in the role of...