cover image
Cohere

Cohere

cohere.com

20 Jobs

780 Employees

About the Company

Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products designed to solve real-world business problems. We partner closely with companies to deliver seamless integration, deep customization, and easy-to-use solutions for their workforce. Our all-in-one platform offers enterprises the highest levels of data security, privacy and optionality to deploy across all major cloud providers, private cloud environments, or on-premises.

Listed Jobs

Company background Company brand
Company Name
Cohere
Job Title
Software Engineer Intern (Winter 2026)
Job Description
Job Title: Software Engineer Intern (Winter 2026) Role Summary: Assist in building and scaling machine‑learning infrastructure and API features that drive AI‑powered products. Intern will own end‑to‑end development tasks, from designing data pipelines to deploying production services, with a focus on reliability, performance, and user impact. Expectations: - Complete full‑stack coding tasks on time and maintain high code quality. - Own feature development from design through production and post‑deployment monitoring. - Demonstrate ability to learn rapidly, adopt new tools, and contribute to team knowledge bases. - Communicate progress clearly via documentation and stand‑ups. - Engage with mentors and cross‑functional teams to validate design and implementation decisions. Key Responsibilities: - Develop browser and server‑side code for user‑facing products. - Build and maintain API features that directly serve external customers. - Design, implement, and optimize data pipelines (web crawlers, ingestion, filtering, storage). - Architect scalable services or infrastructure components for machine‑learning development. - Create and maintain internal tooling (CI/CD pipelines, developer utilities, monitoring dashboards). - Write clear, maintainable technical documentation for internal and external stakeholders. - Stay current with emerging technologies to improve system performance and reliability. Required Skills: - Proficient programming in one or more of: Python, Go, JavaScript/TypeScript. - Experience with REST/GraphQL API design and implementation. - Familiarity with distributed data processing (e.g., Apache Beam, Spark, or similar). - Knowledge of cloud services (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes). - Understanding of CI/CD principles and tooling (Git, GitHub Actions, Jenkins, or equivalent). - Ability to write clean, well‑tested code and maintain documentation. - Strong problem‑solving, debugging, and performance optimization skills. - Excellent written and verbal communication; comfortable collaborating across teams. Required Education & Certifications: - Currently enrolled in an undergraduate or graduate program in Computer Science, Electrical Engineering, Data Science, or a related STEM field. - Coursework or project experience in machine‑learning, natural‑language processing, or large‑scale data systems is strongly preferred. - No specific certifications required, but familiarity with relevant cloud provider certifications (e.g., AWS Certified Developer) is a plus.
San francisco, United states
On site
Fresher
26-12-2025
Company background Company brand
Company Name
Cohere
Job Title
AI Data Trainer, English Language
Job Description
Job Title: AI Data Trainer, English Language Role Summary Independent contractor position (16 hrs/week, remote, Canada only). Responsible for labeling, ranking, auditing, and correcting machine learning data to improve large‑language‑model performance. Works solo with minimal supervision, using provided guidelines to maintain data integrity and quality. Expectations - Full‑time availability of 16 hours per week. - Bring own laptop and reliable internet. - Complete tasks accurately and on schedule. - Participate in written and video assessments to demonstrate English skills. - Sign a contractor agreement; maintain confidentiality of project data. Key Responsibilities 1. Label and rank language data across formats (text, JSON, CSV, TSV, Markdown). 2. Audit outputs for errors and discrepancies, correcting data where necessary. 3. Complete reading‑based assignments quickly and precisely. 4. Execute preference‑based tasks by selecting responses that best adhere to project guidelines. 5. Identify opportunities to enhance data quality and model performance, proposing optimization ideas. 6. Provide clear, concise feedback to cross‑functional teams on language‑related issues. Required Skills - Native or near‑native English proficiency (reading, writing, editing). - Expert reading and writing ability; proofreading skills. - Basic experience manipulating JSON, TSV/CSV, and Markdown files. - Exceptional attention to detail; commitment to accuracy. - High tolerance for repetitive, monotonous work. - Ability to follow complex instructions and work independently. - Strong time‑management and urgency. - Curiosity about technology and creative problem‑solving. - Reliable laptop and high‑speed internet connection. Required Education & Certifications - High school diploma or equivalent (no specific certification mandatory). ---
Calgary, Canada
On site
29-12-2025
Company background Company brand
Company Name
Cohere
Job Title
Staff Research Engineer, Model Efficiency
Job Description
**Job Title:** Staff Research Engineer, Model Efficiency **Role Summary:** Focus on advancing Large Language Model (LLM) inference efficiency to enhance model performance and scalability for AI applications. Drive innovation across model architecture, algorithm optimization, and software/hardware integration. **Expectations:** PhD in Machine Learning or a related field; expertise in optimizing LLM inference under resource constraints; experience with efficiency-enhancing techniques (e.g., MoE routing, decoding algorithms); strong software engineering capabilities; adaptable to fast-paced, ambiguous environments; mentorship and publication track record. **Key Responsibilities:** Develop and deploy techniques to improve model execution efficiency; optimize architecture and inference-time algorithms; design software/hardware co-strategies for GPU acceleration; ensure performance gains without sacrificing model quality. **Required Skills:** Proficiency in LLM architecture and optimization; expertise in model efficiency methods; advanced software engineering skills; ability to conduct research and implement solutions in dynamic settings. **Required Education & Certifications:** PhD in Machine Learning, Computer Science, or related discipline; publications in top-tier conferences (e.g., ICLR, NeurIPS).
Toronto, Canada
On site
12-01-2026
Company background Company brand
Company Name
Cohere
Job Title
Member of Technical Staff, Model Efficiency
Job Description
**Job Title** Member of Technical Staff, Model Efficiency **Role Summary** Design, implement, and optimize high‑performance inference solutions for large language models (LLMs). Collaborate with modeling and systems teams to identify bottlenecks, develop performance improvements, and ship production‑ready enhancements that reduce latency and increase throughput across diverse workloads. **Expectations** - 5+ years of experience developing production‑grade, high‑performance code. - Proven ability to ship features quickly, measure impact, and iterate. - Strong bias toward action and problem resolution. **Key Responsibilities** - Analyze end‑to‑end model execution stack to locate performance bottlenecks. - Engineer optimizations at the system, kernel, and algorithmic levels (e.g., GPU/CUDA, memory layout, KV‑cache, MoE routing). - Experiment with and benchmark new inference techniques (speculative decoding, dynamic batching). - Collaborate with model developers to align architectural changes with inference efficiency goals. - Write clean, maintainable code in C++ or Python (Rust/Go experience is a plus). - Document performance findings and share best practices with the broader engineering community. **Required Skills** - Expert programming in C++ or Python; proficiency in Rust or Go is welcome. - Deep understanding of LLM inference ecosystems (vLLM, SGLang, etc.). - Strong systems‑level troubleshooting and profiling skills. - Experience with GPU programming, CUDA, or low‑level systems optimization. - Familiarity with transformer‑based language modeling, MoE, and KV‑cache optimization. - Comfortable working in distributed, performance‑critical environments. **Required Education & Certifications** - Bachelor’s degree in Computer Science, Electrical Engineering, or a related field (advanced degrees preferred). - No mandatory certifications, but experience with industry‑standard profiling and benchmarking tools is expected.
Toronto, Canada
On site
Mid level
12-01-2026