Job Description

Hello,

We have an requirement for below position, kindly check and if interested then please share your updated copy of resume

Role: Site Reliability Engineer (SRE)

Location: Providence/ Rhode Island

Duration: Contract

Job Summary:

Seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to design, build, and maintain scalable, resilient, and high-performing IT infrastructure. The ideal candidate will have deep expertise in cloud and hybrid environments, automation, observability, and infrastructure operations, with a strong focus on minimizing downtime and improving service reliability.

Key Responsibilities:

1. Infrastructure Design & Implementation

Design and implement highly available and resilient infrastructure solutions.
Ensure system scalability, fault tolerance, and disaster recovery across on-premise, cloud, and hybrid environments.
Define and maintain architecture standards and best practices for infrastructure design.

2. Service Reliability & Performance

Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Ensure high availability and performance of critical systems and applications.
Collaborate with cross-functional teams to improve overall system reliability.

3. Incident & Problem Management

Proactively monitor and troubleshoot system and infrastructure issues to reduce Mean Time to Recovery (MTTR).
Lead Root Cause Analysis (RCA) for high-priority incidents (P1/P2) and drive preventive actions.
Maintain documentation for incident response procedures and recovery plans.

4. Automation & Infrastructure as Code (IaC)

Develop and implement Infrastructure-as-Code solutions using tools such as Terraform, Ansible, or similar.
Automate routine and repetitive operational tasks to reduce manual intervention and eliminate toil.
Manage version-controlled infrastructure and perform code reviews for IaC deployments.

5. Observability & Monitoring

Set up and manage monitoring and observability tools (e.g., Grafana, Prometheus, ELK stack, Azure Monitor).
Implement logging, metrics, and alerting to ensure visibility into the health and performance of systems.
Ensure early detection of anomalies and quick resolution of production issues.

6. DevOps & CI/CD Integration

Collaborate with DevOps teams to integrate CI/CD pipelines with infrastructure operations.
Support containerized environments using Docker and Kubernetes.
Maintain and optimize deployment processes for both infrastructure and applications.

7. Collaboration & Leadership

Act as a subject matter expert (SME) and provide guidance throughout the solution lifecycle.
Work closely with clients and stakeholders to understand business needs and design appropriate solutions.
Mentor junior engineers and participate in knowledge-sharing initiatives.

Required Skills & Qualifications:

Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience managing and scaling IT infrastructure in cloud, on-premise, or hybrid environments.
Strong scripting/programming skills in Python, Bash, or PowerShell.
Proficiency in one or more cloud platforms: AWS, Azure, or GCP.
Expertise in automation tools: Terraform, Ansible, Chef, or similar.
Hands-on experience with observability and monitoring tools (e.g., Grafana, ELK, Prometheus).
Solid understanding of networking, virtualization, and storage technologies.
Familiarity with DevOps practices and container orchestration tools (e.g., Docker, Kubernetes).
Excellent troubleshooting, analytical, and problem-solving skills.

Thanks & Regards,

Rutuja Choudhari || US IT Recruiter

Nityo Infotech Corp

(609) 857-8238

rutuja.choudhari@nityo.com

If you feel you received this email by mistake or wish to unsubscribe, kindly reply to this email with "UNSUBSCRIBE" in the subject line

Hello,

We have an requirement for below position, kindly check and if interested then please share your updated copy of resume

Role: Site Reliability Engineer (SRE)

Location: Providence/ Rhode Island

Duration: Contract

Job Summary:

Key Responsibilities:

1. Infrastructure Design & Implementation

Design and implement highly available and resilient infrastructure solutions.
Ensure system scalability, fault tolerance, and disaster recovery across on-premise, cloud, and hybrid environments.
Define and maintain architecture standards and best practices for infrastructure design.

2. Service Reliability & Performance

Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
Ensure high availability and performance of critical systems and applications.
Collaborate with cross-functional teams to improve overall system reliability.

3. Incident & Problem Management

Proactively monitor and troubleshoot system and infrastructure issues to reduce Mean Time to Recovery (MTTR).
Lead Root Cause Analysis (RCA) for high-priority incidents (P1/P2) and drive preventive actions.
Maintain documentation for incident response procedures and recovery plans.

4. Automation & Infrastructure as Code (IaC)

Develop and implement Infrastructure-as-Code solutions using tools such as Terraform, Ansible, or similar.
Automate routine and repetitive operational tasks to reduce manual intervention and eliminate toil.
Manage version-controlled infrastructure and perform code reviews for IaC deployments.

5. Observability & Monitoring

Set up and manage monitoring and observability tools (e.g., Grafana, Prometheus, ELK stack, Azure Monitor).
Implement logging, metrics, and alerting to ensure visibility into the health and performance of systems.
Ensure early detection of anomalies and quick resolution of production issues.

6. DevOps & CI/CD Integration

Collaborate with DevOps teams to integrate CI/CD pipelines with infrastructure operations.
Support containerized environments using Docker and Kubernetes.
Maintain and optimize deployment processes for both infrastructure and applications.

7. Collaboration & Leadership

Act as a subject matter expert (SME) and provide guidance throughout the solution lifecycle.
Work closely with clients and stakeholders to understand business needs and design appropriate solutions.
Mentor junior engineers and participate in knowledge-sharing initiatives.

Required Skills & Qualifications:

Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience managing and scaling IT infrastructure in cloud, on-premise, or hybrid environments.
Strong scripting/programming skills in Python, Bash, or PowerShell.
Proficiency in one or more cloud platforms: AWS, Azure, or GCP.
Expertise in automation tools: Terraform, Ansible, Chef, or similar.
Hands-on experience with observability and monitoring tools (e.g., Grafana, ELK, Prometheus).
Solid understanding of networking, virtualization, and storage technologies.
Familiarity with DevOps practices and container orchestration tools (e.g., Docker, Kubernetes).
Excellent troubleshooting, analytical, and problem-solving skills.

Thanks & Regards,

Rutuja Choudhari || US IT Recruiter

Nityo Infotech Corp

(609) 857-8238

rutuja.choudhari@nityo.com

If you feel you received this email by mistake or wish to unsubscribe, kindly reply to this email with "UNSUBSCRIBE" in the subject lin

Job Tags

Contract work,

Similar Jobs

Springdale Ice Cream and Beverage

PHARMACY/PHARMCST-INTERN (19) Job at Springdale Ice Cream and Beverage

Job Description Create an outstanding customer experience through exceptional service. Establish and maintain a safe and clean environment that encourages our customers to return. Assist the department manager in reaching sales and profit goals established for the department...

ChristianaCare

Private Duty Nurses (RN and LPN) | Home Health | Nights Job at ChristianaCare

...one on one with your patient in the home health setting? If so, Private Duty Nursing might be a good fit for you!ChristianaCare HomeHealth... ...medications and prescribed treatments as orderedRequirements:DE LPN or RN or license with one year experience working in a healthcare...

Amergis

School-Based Occupational Therapist Job at Amergis

Salary: $55 / HourThe Occupational Therapist (OT) is responsible forperforming student evaluations, developing and providing therapy services,... ...meaningful opportunities to our extensive network of healthcare and school-based professionals, ready to work in any hospital,...

CSN Collision

Auto Body Technician Job at CSN Collision

...We are currently seeking a skilled B-Level Auto Body Technician to join our team. This individual will work in a fast-paced environment, handling collision repairs on a variety of vehicles. The ideal candidate will have a strong understanding of automotive repair procedures...

Marshall Medical Centers

Nurse Apprentice Job at Marshall Medical Centers

Nurse Apprentice Location Guntersville, AL : Job Summary Statement The following statements reflect the general duties considered necessary... ...may be inherent in the position. The Apprentice Nursing Student works under a supervising licensed nurse serving in the role of...

Site Reliability Engineer (SRE) Job at Nityo Infotech, Rhode Island

cVVObFNHMU9WcFJ3RFNYd1M2VlFlRE5LZmc9PQ==