Software Engineer, Applied ML - 2025 at Brave

London, England, United Kingdom -

Full Time

Start Date

Immediate

Expiry Date

22 Nov, 25

Salary

155000.0

Posted On

23 Aug, 25

Experience

2 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

Skills

Machine Learning, Pipeline Development, Tgi, Computer Science, Kubernetes, Scratch, Production Experience, Load Testing, Performance Tuning

Industry

Information Technology/IT

Description

SUMMARY

Join Brave’s mission to revolutionize web browsing through AI. We’re looking for an experienced ML Engineer to build next-generation features that serve nearly 100 million users worldwide. You’ll work with state-of-the-art language models, collaborating across teams to ship innovative AI capabilities that make the browser smarter and more capable—all while maintaining our privacy-first principles.

REQUIRED QUALIFICATIONS

2 to 5 years of experience optimizing and deploying ML models in production environments
Strong software engineering background with production experience
Extensive experience with PyTorch or other modern ML frameworks
Experience training custom models from scratch
Experience with model optimization and inference frameworks (e.g., vLLM, ONNX Runtime, Nvidia Triton)
Familiarity with MLOps practices & Kubernetes and ability to collaborate with DevOps teams on model monitoring, load testing, and CI/CD pipelines
Experience shipping ML-powered features or systems (consumer applications preferred)

PREFERRED QUALIFICATIONS

Master’s degree in Computer Science, Machine Learning, or related field
Familiarity with LLM serving frameworks (vLLM, TGI, Ray Serve) and GPU optimization
Experience with embeddings, vector databases, semantic search implementations, model training workflows, and data pipeline development
Experience integrating LLMs with tool calling/MCP
Knowledge of privacy-preserving ML techniques and on-device model deployment
Previous work on cost optimization and performance tuning of ML systems at scale

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities

Evaluate, integrate, and deploy state-of-the-art language models for Leo and other browser AI capabilities, including both cloud-based and on-device deployment scenarios
Design, optimize, and maintain ML inference pipelines for browser-integrated AI features, with focus on reducing deployment costs and improving model performance
Develop and train custom ML models for browser-specific use cases such as content classification and search optimization using techniques like LoRA and DPO, including distributed training setups
Generate synthetic data for training data augmentation and model evaluation
Collaborate with browser engineering teams to seamlessly integrate AI capabilities into core product features while maintaining performance and privacy standards
Collaborate with product and design teams to define, prototype, and ship new AI-powered features including text-to-speech, image generation, and enhanced tool calling capabilities
Implement and optimize model serving infrastructure using frameworks like vLLM, ONNX Runtime, and Nvidia Triton to achieve production-scale performance requirements
Collaborate with DevOps teams on MLOps infrastructure including model monitoring, load testing, caching optimization, and automated CI/CD pipelines for model deployments
Contribute to privacy-preserving ML approaches and on-device model implementations that align with Brave’s privacy-first mission