cover image
Datadog

Datadog

datadoghq.com

8 Jobs

8,418 Employees

About the Company

Datadog is the essential monitoring platform for cloud applications. We bring together data from servers, containers, databases, and third-party services to make your stack entirely observable. These capabilities help DevOps teams avoid downtime, resolve performance issues, and ensure customers are getting the best user experience.


Listed Jobs

Company background Company brand
Company Name
Datadog
Job Title
Senior Software Engineer (MLOps) – Serving
Job Description
Job Title: Senior Software Engineer (MLOps) Role Summary: Design, implement, and scale end‑to‑end infrastructure for serving machine learning and large‑language‑model inference across distributed data centers, ensuring high availability, observability, and performance for production workloads. Expectations: Lead architecture and development of distributed inference systems; collaborate across infrastructure, observability, and ML engineering; deliver self‑service tooling for applied scientists; maintain SLAs and optimize for multi‑GPU and heterogeneous compute environments. Key Responsibilities: - Architect and build scalable ML/LLM model serving pipelines with Ray or equivalent frameworks. - Design inference infrastructure for low- to high-throughput workloads, including CPU and GPU deployment strategies. - Enable self‑service CI/CD, rollback, and shadow traffic for model lifecycle management. - Implement A/B testing and shadow deployments to evaluate new model versions. - Collaborate with platform teams on GPU provisioning, traffic routing, and runtime performance tuning. - Instrument inference workflows with comprehensive telemetry (latency, tokens, errors) for performance and safety analytics. - Troubleshoot and optimize distributed system performance and reliability. Required Skills: - 6+ years backend/infrastructure engineering; 2+ years on ML/AI platforms. - Proven experience building distributed systems for model serving or large‑scale APIs. - Strong proficiency in Python, Go, or comparable systems languages. - Experience with Ray, TorchServe, Triton, or BentoML. - Hands‑on knowledge of GPU compute and heterogeneous workload orchestration. - Solid understanding of performance tuning, observability, and CI/CD pipelines. - Familiarity with AI observability, rollback strategies, or deployment proxies is a plus. Required Education & Certifications: - Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
Valbonne, France
On site
Senior
12-11-2025
Company background Company brand
Company Name
Datadog
Job Title
Software Engineer with Systems Depth
Job Description
Job title: Software Engineer – Systems Infrastructure Role Summary: Design, build, and maintain scalable, high‑performance infrastructure that supports millions of customers and high‑volume, low‑latency workloads. Deliver reliable, cost‑efficient services, troubleshoot and resolve bottlenecks, automate operations, and collaborate with developers on data modeling and datastore selection. Expectations: - Own infrastructure delivery from design to production, ensuring minimal downtime and optimal cost. - Actively address production issues, conducting deep root‑cause analyses. - Maintain 24/7 availability for critical services and datastores. - Champion simplicity, reliability, durability, and scalability in all solutions. Key Responsibilities: - Architect and implement infrastructure components to scale with exponential growth. - Diagnose and fix performance bottlenecks across platform code and client applications. - Provide 24x7 support for owned services and databases. - Develop tooling and automation that reduce manual intervention and improve observability. - Partner with developers to design data models and evaluate datastore options. Required Skills: - Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field, or at least 5 years of professional experience. - Production experience with distributed systems (e.g., Zookeeper, Cassandra, Postgres, FoundationDB, Kafka, Elasticsearch, Redis, MongoDB). - Proficiency in one or more programming/scripting languages (Go, Java, Rust, C/C++). - Strong ability to design simple, performant architectures for large‑scale problems. - Familiarity with AI tools for development or eagerness to adopt them. - Bonus: Hands‑on experience with a major cloud provider (AWS, Azure, GCP). Required Education & Certifications: - Bachelor’s degree in Computer Science, Engineering, or a quantitatively related field, or equivalent professional experience. - No mandatory certifications; relevant technical certifications (e.g., cloud provider credentials) are a plus.
Bordeaux, France
On site
Mid level
30-11-2025
Company background Company brand
Company Name
Datadog
Job Title
Software Engineer - Backend Generalist
Job Description
**Job title:** Software Engineer – Backend Generalist **Role Summary:** Design, develop, and maintain highly scalable backend services for a global observability platform. Own end‑to‑end service lifecycle—from architecture and scaling to deployment and incident response—while collaborating with cross‑functional teams to deliver production‑grade features. **Expectations:** - Deliver performant, reliable code that scales to millions of requests per day. - Own service ownership: from design through production support and optimization. - Communicate clearly with teammates and stakeholders in a fast‑moving, high‑growth environment. - Embrace AI‑assisted development tools and stay current on emerging backend technologies. - Contribute to continuous improvement of processes, toolchains, and best practices. **Key Responsibilities:** - Identify and resolve scaling bottlenecks in critical production services. - Design and implement scalable architecture for new and existing services. - Deploy new features using feature‑flagged rollouts and monitor impact. - Investigate, triage, and fix production incidents in owned services. - Collaborate with product, data, and operations teams to prioritize upcoming projects and define success metrics. - Conduct code reviews, enforce coding standards, and mentor junior engineers. - Maintain and evolve CI/CD pipelines and automated testing suites. **Required Skills:** - 3+ years of production backend development experience. - Expertise in at least one language (e.g., Go, Java, Python, Rust). - Strong understanding of distributed system design, microservices, and API patterns. - Experience with performance tuning, load testing, and metric‑driven optimization. - Familiarity with CI/CD, container orchestration (Kubernetes), and cloud infrastructure. - Ability to use feature‑flag systems and analyze feature rollout data. - Proficient debugging skills across distributed stacks (traces, logs, metrics). - Commitment to clean, maintainable code and solid testing practices. - Comfortable leveraging AI tools for code generation, review, and problem solving. **Required Education & Certifications:** - Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or a related scientific field, or equivalent professional experience. - No mandatory certifications required.
Paris, France
On site
04-12-2025
Company background Company brand
Company Name
Datadog
Job Title
Director, Major Accounts
Job Description
**Job Title**: Director, Major Accounts **Role Summary** Lead a high‑performance team of Major Account Managers to drive new business and expand revenue within the largest, most strategic Enterprise clients. Set and achieve aggressive booking targets, develop go‑to‑market strategy, and ensure effective collaboration across Sales, Marketing, Product, and Success functions. **Expectations** - Deliver annual Enterprise bookings quota with consistent monthly/quarterly performance. - Scale and coach a team, driving win rate, deal size, and forecast accuracy. - Navigate complex, multi‑stakeholder sales cycles and negotiate pricing based on business value. - Maintain strong executive relationships in Fortune 1000 accounts. - Travel up to 70 % of the time (auto, train or air) and be onsite with clients as needed. **Key Responsibilities** - Recruit, hire, train, and ramp Major Account Managers. - Set quarterly/monthly quotas, monitor productivity metrics, and conduct forecast meetings. - Coach managers on proactive sales tactics, executive engagement, and negotiation. - Define and execute regional go‑to‑market strategy; collaborate with Marketing and Product to craft targeted messaging and collateral. - Participate in and lead client and prospect meetings; serve as escalation point for complex deals. - Work cross‑functionally with Success to map customer journeys and identify upsell opportunities. - Provide regular updates to leadership on pipeline health, revenue forecasts, and team performance. **Required Skills** - B2B technology enterprise sales experience (≥5 years) with Fortune 1000 customers. - Proven leadership of high‑performing sales teams and quota setting/management. - Strong relationship builder with C‑level executives, skilled in complex negotiations and value‑based pricing. - Deep understanding of full sales cycle, from prospecting to closing. - Excellent communication, coaching, and presentation skills. - Ability to travel frequently and work with corporate resources. **Required Education & Certifications** - Bachelor’s degree in Business, Marketing, Engineering, or related field. - Familiarity with MEDDIC or similar sales methodology preferred. ---
United kingdom
Remote
Mid level
13-12-2025