USNLX Diversity Jobs

USNLX Diversity Careers

Job Information

Epsilon, Inc. Site Reliability Engineer III in Kansas City, Missouri

Site Reliability Engineer III

Who is Epsilon:

Epsilon is an IT Services company that was founded in 2009 and has become an established leader in providing Information Technology services to both Federal Government and Commercial businesses across the United States. Epsilon is known for its solution-focused and innovative approach, aligning technology systems, tools, and processes with the missions and objectives of its customers.

Epsilon’s headquarters are in Weaverville, NC with other corporate offices in Greenville, SC, Crystal City, VA, and Denver, CO. We have employees in 30+ States across the U.S.

Why work for Epsilon:

In joining Epsilon’s team, you will have the opportunity to contribute to Epsilon’s business and customer initiatives, as well as influence our brand culture through people interaction and technology advancements.

Epsilon invests in our employees by promoting from within and enabling employees to elevate their knowledge and skill set in their profession by allocating $3,000 annually in Professional Development funds. We also offer competitive pay, comprehensive benefits through one of the largest national carriers, Paid Time Off (PTO) that increases with tenure and has a generous rollover, 11 company paid Holidays, and 401(k) with immediate contribution.

Where you’ll work:

This fully remote opportunity allows you the flexibility to work from home in support of Epsilon’s USDA DISC Customer.

Our Customer’s Mission :

The USDA Digital Infrastructure Services Center (DISC) operates 24/7/365 to provide comprehensive on premises and cloud-based hosting services, including Disaster Recovery, security, and professional support services and operations to approximately 35 federal organizations. The USDA and other Federal partners depend upon DISC’s highly complex and interconnected technology infrastructure to conduct their operations. To better support this mission, DISC is modernizing their technology by transitioning to a continuous integration, deployment, and code-based organization.

An average day:

As Site Reliability Engineer III (SRE), you will leverage IT expertise and automation tools to monitor and ensure software reliability in the production environment. This position will identify and resolve software issues through coding, maintain system stability, and optimize performance by blending technical expertise in operations with software development. In this position you will:

  • Develop and maintain Infrastructure as Code (IaC) using tools such as Packer, Artifactory, Ansible, and Terraform.

  • Manage and optimize Continuous Integration and Continuous Deployment (CI/CD) pipelines, with a preference for GitLab.

  • Ensure adherence to Source Control best practices, particularly using GitLab.

  • Create and maintain scripts in languages like Bash, PowerShell, and Python to automate tasks and enhance system reliability.

  • Collaborate with team members and cross-departmental partners to establish and maintain SRE practice in an Agile Scrum framework.

  • Participate in system design reviews to identify points of failure, promote automation and self-healing.

  • Participate in code reviews to ensure efficiency, testability, and scalability.

  • Participate in incident management ceremonies to analyze the root cause and steps to mitigate future occurrences and reduce downtime.

  • Create and maintain relevant documentation for systems and processes.

Basic Qualifications:

  • As a requirement of this position, all candidates must be a U.S. Citizen. In accordance with 8 U.S.C. 1324b(a)(2)(C) , Epsilon will not consider candidates for this position who do not meet the aforementioned conditions.

  • Bachelor’s degree in Information Technology, Software Development, or a related field, or equivalent professional experience.

  • 8+ years of experience in IT administration, software engineering, or platform engineering, with a focus on datacenter infrastructure and enterprise systems.

  • 1+ year of experience deploying and managing enterprise cloud technologies, with a focus on AWS or Azure.

  • 4+ years of experience using CI/CD and IaC tools, such as Terraform, Ansible Automation Platform, GitLab, Artifactory, and Packer.

  • Strong proficiency in scripting languages (Python, PowerShell, Bash) and automation technologies, with a preference for Python.

  • High proficiency in Windows and Linux operating systems, including networking concepts and troubleshooting.

  • Robust analytical and troubleshooting skills, with the ability to identify solutions to complex problems and think logically through challenging situations.

  • Experience in mentoring roles, particularly in guiding teams through complex technical challenges and promoting a DevOps culture.

  • Proficiency working within Agile methodologies (Scrum, Kanban, SAFe) and the software development lifecycle (SDLC) process.

  • Demonstrated experience in designing, deploying, and supporting cloud-based systems, with knowledge of monitoring, logging, security, and scalability.

  • Strong communication skills, with the ability to collaborate effectively with cross-functional teams and stakeholders, and a high level of integrity and accountability.

  • Strong written and oral communication skills in the English language. Must be able to read, write, speak, and understand English.

  • Ability to communicate applicable technical subject matter expertise to management and others.

  • Demonstrate experience based on ITIL framework:

  • ITIL v4 foundation knowledge

  • Ability to apply and provide feedback on service operation model and practices.

Other Requirements:

  • Must be able to pass federal background investigation and obtain a Public Trust

Physical Demands and Working Conditions:

Listed below are the physical or mental requirements necessary for the job's performance. Reasonable accommodation may be made to enable individuals with disabilities to perform essential job functions:

  • Prolonged periods of computer desk work.

  • Dexterity of hands and fingers to operate a computer keyboard and other computer components.

  • Speaking and hearing are sufficient to converse and understand conversations, both in-person, telephone, and virtual meetings.

  • The cognitive skills needed to complete tasks, including abilities such as learning, remembering, focusing, categorizing, and integrating information for decision-making, problem-solving, and comprehending.

  • Ability to learn new tasks, remember processes, maintain focus, complete tasks independently, make timely decisions in the context of a workflow, and the ability to communicate with managers and co-workers.

  • Mental aptitude to respond appropriately in high-pressure situations or deadline-driven environments.

  • Maintain a professional emotional response when working with others.

Connect directly with your dedicated recruiter, Jessica, on Epsilon’s careers page.

www.epsilon-inc.com/careers

Epsilon is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applications will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. EEO/AA: Minorities/Females/Disabled/Vets.

Please click here (https://www.eeoc.gov/sites/default/files/2023-06/22-088_EEOC_KnowYourRights6.12ScreenRdr.pdf) to review your rights under EEO policy.

If you are an individual with a disability and need special assistance or reasonable accommodation in applying for employment with Epsilon, Inc., please contact our Recruiting department by phone 828-398-5414 or by email careers@epsilon-inc.com .

We will be accepting applications through 9/18/2024.

#LI-DNP

DirectEmployers