Lead AI Engineer (LLMs & Data Pipelines)

We are seeking a Lead AI Engineer (LLMs & Data Pipelines) to drive the design, integration, and operational excellence of LLM-powered capabilities across our platforms.

In this role, you will build intelligent features such as classification, extraction, summarization, and action orchestration powered by large language models. You will design embedding and retrieval pipelines (RAG, semantic search), create robust data pipelines for training and evaluation, and define clear evaluation metrics and quality gates to ensure reliable LLM behavior in production.

You will work hands-on with inference runtimes such as ONNX Runtime and TensorFlow Lite, benchmarking performance across CPU, GPU, NPU, and DSP environments, and optimizing deployments for latency, cost, and reliability—including in constrained or embedded systems. Collaborating with engineering, data, and MLOps teams, you will integrate models into real-world APIs and production systems while continuously experimenting with prompts, architectures, and model choices.

If you are passionate about turning advanced AI research into scalable, production-ready systems and enjoy balancing performance, accuracy, and operational constraints, this may be your next mission.

May your next career adventure begin here.

Overall responsibilities and duties:

Build and integrate LLM-powered features (classification, extraction, summarization, actions).

Integrate models with inference runtimes (such as ONNX Runtime, TensorFlow Lite / LiteRT).

Benchmark and validate model performance across different hardware backends (CPU, GPU, NPU, DSP).

Design embedding and retrieval pipelines (RAG, semantic search).

Create and maintain data pipelines for training and evaluation.

Define evaluation metrics and quality gates for LLM behavior.

Optimize inference for latency, cost, and reliability.

Integrate models into production systems and APIs.

Run experiments to evaluate prompts, models, and architectures.

Qualifications:

Strong experience with LLMs and NLP systems

Hands-on experience with embeddings and vector databases

Strong Python skills and ML frameworks

Experience building production data pipelines

Solid understanding of evaluation and regression detection

Experience with RAG architectures

MLOps or monitoring experience

Experience with model calibration and accuracy/latency trade-off analysis.

Hands on experience deploying models on edge or embedded devices (constrained environments)

What you’ll enjoy at Appning:

Work From Home Allowance to support your home office setup and comfort.
Comprehensive Health Insurance for every employee.
Generous Time Off – Extra days off based on your years with us.
A Day Off on Your Birthday to celebrate your way.
Employee Workplace Program (EWP) designed to support your wellbeing and growth.
Flexible Working Hours to help you balance work and life.
Free Breakfast and Snacks at the office.
Exclusive Platform Discounts through our Inspiring Benefits Program.
Referral Program – Bring great people and get rewarded.
Amazing Team Gatherings that you’ll actually look forward to.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Bauer Media first broadcaster to launch their own car app with Radioplayer and Appning

We took a tour of the future of in-car entertainment and connectivity | The Verge

On the Road to the Future: Exploring In-Car Streaming Entertainment

The case for Android Automotive OS

Powering the connected car: Faurecia Aptoide Automotive App Store

Incorporating Driver Distraction Guidelines into AAOS Apps