Start Date
Immediate
Expiry Date
12 Sep, 26
Salary
0.0
Posted On
14 Jun, 26
Experience
2 year(s) or above
Remote Job
Yes
Telecommute
Yes
Sponsor Visa
No
Skills
LLM Inference, vLLM, SGLang, Python, C++, KV Cache Management, Distributed Inference, Tensor Parallelism, Pipeline Parallelism, Performance Profiling, TensorRT-LLM, Transformer Architecture, Speculative Decoding, Multi-modal Pipelines, Continuous Batching, Scheduling
Industry
Software Development