AI Lab - LLM Applied Evaluation and Benchmark Intern at CUMMINGS INC
Chaoyang District, Beijing, China -
Full Time


Start Date

Immediate

Expiry Date

22 Jun, 26

Salary

0.0

Posted On

24 Mar, 26

Experience

0 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Large Language Model, Evaluation Frameworks, Benchmarking Datasets, Chatbot, ChatBI, AI Agent, Prompt Engineering, RAG, Agent Frameworks, Software Testing, Python, Data Processing, Data Privacy, SQL, Data Analysis, Logical Thinking

Industry

Motor Vehicle Manufacturing

Description
This position is not available in GPP database. Talent Acquisition team member will fill in the Posting description after intake meeting. Cummins is an equal opportunity employer. Our policy is to provide equal employment opportunities to all qualified persons without regard to race, sex, color, disability, national origin, age, religion, union affiliation, sexual orientation, veteran status, citizenship, gender identity, or other status protected by law.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities
The role involves designing and executing evaluation frameworks and building benchmarking datasets and metrics for Large Language Model applications like Chatbot and ChatBI systems. Key tasks include testing AI Agent-based applications, analyzing model output quality, identifying issues like hallucinations, and documenting findings for stakeholders.
Loading...