Junior Site Reliability Engineering Job at Jobright.ai, Atlanta, GA

eVg3QnVFaVlhcEFJYjNXN25ZU2w3YmNk
  • Jobright.ai
  • Atlanta, GA

Job Description

Join to apply for the Junior Site Reliability Engineering role at Jobright.ai 2 days ago Be among the first 25 applicants Join to apply for the Junior Site Reliability Engineering role at Jobright.ai Jobright is an AI-powered career platform that helps job seekers discover the top opportunities in the US. We are NOT a staffing agency. Jobright does not hire directly for these positions. We connect you with verified openings from employers you can trust. Job Summary: Geotab is a global leader in IoT and connected transportation, known for its diverse and innovative work culture. The Site Reliability Engineer will ensure the availability, reliability, and performance of Geotab's core products, acting as a primary escalation point for critical application issues and collaborating with various technical teams to restore service and improve system stability. Responsibilities:

  • Act as a primary escalation point for critical production application/product issues.
  • Rapidly troubleshoot complex problems across the application stack, utilizing observability tools to identify root causes.
  • Coordinate effectively with development, infrastructure, and other technical teams during incidents to implement fixes and restore service swiftly.
  • Clearly communicate incident status, impact, and resolution steps to internal stakeholders.
  • Collaborate with team members to improve monitoring tools, dashboards, and alerting mechanisms for proactive detection of issues impacting Critical User Journeys (CUJs) within the application/product and computing architecture. Our complex environment encompasses monolithic applications, microservices, and a vast ecosystem of millions of hardware units.
  • Monitor application/product and system health proactively using a combination of tools to ensure high availability and adherence to Service Level Objectives (SLOs) / Service Level Agreements (SLAs).
  • Identify opportunities and implement automation tools/scripts to streamline routine operational tasks, reduce manual effort (toil), and improve response times.
  • Conduct system tests to validate performance, reliability, and successful remediation of issues.
  • Recommend design and process enhancements based on operational experience to improve overall application reliability and maintainability.
  • Participate in post major incident reviews (PMIRs) to analyze disruptions, document findings, track corrective actions to prevent recurrence, and identify areas of improvement for incident response processes.
  • Contribute to building a culture of learning from incidents.
  • Participate in a 24x7 on-call rotation to provide timely support for critical issues outside of business hours.
Qualifications: Required:
  • 3 - 5 years experience in SRE/DevOps/Tier 3.
  • Strong troubleshooting skills with a systematic problem-solving approach.
  • Extensive experience resolving critical incidents in production environments.
  • Strong proficiency in Linux and operational scripting (Bash, Powershell, Python).
  • Experience with database/dataset querying (GoogleSQL, PostgreSQL, BigData), automated configuration management (via tools like Ansible), and GitOps tools (Argo CD).
  • Experience with data visualization platforms (e.g., Apache Superset/BigQuery Visualizations).
  • Familiarity with cloud platforms (GCP/Azure/AWS), container orchestration (Kubernetes), and monitoring/alerting systems (e.g., Prometheus stack including AlertManager/Grafana).
  • Understanding of application environments (e.g., .NET/C#) for troubleshooting purposes.
  • Understanding of fundamental networking concepts (TCP/IP, DNS, Load Balancing) are considered assets.
  • Familiarity with applying AI-powered tools to enhance operational efficiency in areas such as log analysis, troubleshooting assistance, incident summarization, and automation scripting.
  • Demonstrated ability to work well under pressure and manage multiple tasks and projects simultaneously.
  • Experience with incident management processes.
  • Experience working within a technical or engineering organization with knowledge of the high-technology industry is considered an asset.
  • Excellent verbal and written communication skills.
  • Strong analytical skills with the ability to problem solve and develop well-judged decisions.
  • Strong team player with the ability to engage with all levels of the organization.
  • Technical competence using software programs, including but not limited to, Google Suite for business (Sheets, Docs, Slides) or equivalents.
  • Entrepreneurial mindset and comfortable in a flat organization.
Company: Geotab is a provider of secure Open Platform telematics technology for GPS fleet management. Founded in 2000, the company is headquartered in Oakville, Ontario, CAN, with a team of 1001-5000 employees. The company is currently Late Stage. Seniority level Seniority level Entry level Employment type Employment type Full-time Job function Industries Software Development Referrals increase your chances of interviewing at Jobright.ai by 2x Inferred from the description for this job Medical insurance Vision insurance 401(k) Get notified about new Site Engineer jobs in Atlanta, GA . Atlanta, GA $70,000.00-$85,000.00 1 month ago Engineer - Embassy Suites by Hilton Atlanta Buckhead Entry-level Civil or Environmental Engineer Atlanta, GA $60,000.00-$70,000.00 2 days ago Atlanta, GA $95,000.00-$120,000.00 2 weeks ago Director of Residential Engineering - Civil Site Development Norcross, GA $50,000.00-$55,000.00 1 week ago Atlanta, GA $88,000.00-$132,000.00 1 week ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr Jobright.ai

Job Tags

Full time,

Similar Jobs

Northwell Health

Diabetes Educator Job at Northwell Health

 ...health and control of disease. Provides nutritional assessment, education and counseling to individuals and/or groups. Serves as a...  ..., required. Registered Dietitian, required. Current Diabetes Educator Certification, required. OR Bachelors... 

Yochana

Datacenter Technician Job at Yochana

 ...understanding of networking concepts. Familiarity with IT asset management tools and inventory systems. Certifications such as CompTIA Server+, Cisco CCNA, or equivalent are a plus. Will be added advantage. At least 3-4 years of experience in data centers... 

Headway

Licensed Psychiatric Nurse Practitioner (Virtual) Job at Headway

" Licensed Psychiatric Nurse Practitioner Wage: Between $155-$203 an hour Did you know that you can build a flexible private practice on your terms as a psychiatric nurse practitioner?Whether you want to see patients alongside a full-time job or grow a full-...

Arup

Head of Digital Marketing and Social Media - Global (Boston) Job at Arup

 ...increasingly competitive marketplace, our Marketing, Communications, and Business...  ...ambitions. We are seeking to hire a Head of Digital Marketing and Social Media - Global to strengthen...  ...marketing automation and marketing data management capabilities. This is a critical role... 

Northwestern Mutual - Fairfield County

Client Advisor, Seeking Athletes Job at Northwestern Mutual - Fairfield County

 ...network offices. In 2024, the firm had 5 advisors ranked in the Top-10 of the Forbes Best-In-State Financial Securities List (CT) and...  ...and as a bus boy/dishwasher in high school. Passionate about? Golf, snowboarding, scuba diving, and Boston sports, especially football...