Fullstack LLM Application Engineer

Entreprise: Experis
Lieu: Grand-Lancy
Date de publication: 14.08.2025
Référence: 4947523

Description

Experis SA - Your IT Partner in Switzerland, , Experis SA, the Swiss branch of the ManpowerGroup, is a leader in recruiting specialized IT profiles. We offer flexible solutions: temporary assignments, permanent placements, project management, and professional training. Our expertise covers digital transformation, cybersecurity, cloud, infrastructure, and more. Thanks to our global reach combined with a pragmatic and personalized approach, we effectively support both professionals and companies in achieving their goals.Fullstack LLM Application EngineerExperis SA - Your IT Partner in Switzerland
Experis SA, the Swiss branch of the ManpowerGroup, is a leader in recruiting specialized IT profiles. We offer flexible solutions: temporary assignments, permanent placements, project management, and professional training. Our expertise covers digital transformation, cybersecurity, cloud, infrastructure, and more. Thanks to our global reach combined with a pragmatic and personalized approach, we effectively support both professionals and companies in achieving their goals.

Title: LLM Application Engineer (Full-Stack) - AI Chatbots for Financial Environments
Location : Geneva, Switzerland - Hybrid Work Model
Type : Permanent
Languages : Fluent English (C1+), other languages an asset
About the Role
We are looking for an experienced Full-Stack Engineer to design, build, and deploy AI-powered chatbot applications serving mission-critical functions across trading, middle office, and back office environments. You will take end-to-end ownership of the application stack, from UI to secure backend APIs and integrations with existing enterprise systems, collaborating closely with data science and infrastructure teams.
Key Responsibilities

Chat UI Development: Create responsive React/TypeScript interfaces with SSO, semantic search, and chat patterns.
API & Model Serving: Build FastAPI REST/gRPC endpoints, deploy LLMs using Triton, Ray Serve, or AWS SageMaker with GPU-aware autoscaling.
Security: Implement OAuth2/OIDC, prompt validation, rate limits, audit logging, and row-level entitlements.
Integration: Connect with Java/.NET systems, FIX, Kafka, and message queues with low latency.
Monitoring & Deployment: Configure observability dashboards (Prometheus/Grafana), manage GitLab CI/CD, perform blue-green deployments.
Infrastructure Automation: Contribute Terraform modules for EKS, API Gateways, Transit Gateways, and Lambda functions.

Technical Skills Required

Python , FastAPI, LangChain/Haystack, React, TypeScript
Hugging Face, AWS Bedrock SDK, Triton Inference Server, Ray Serve, AWS SageMaker
OAuth2, OIDC, SSO
SQL, pgvector, Pinecone, OpenSearch
Docker, Kubernetes/EKS, Terraform, GitLab CI
REST, gRPC, WebSockets, SSE

Profile

4+ years in developing cloud-native, user-facing applications (finance/trading preferred).
Proven deployment of LLMs or deep learning models in production.
Strong presentation and stakeholder communication skills.
Ability to thrive in a high-speed environment while maintaining enterprise-grade reliability.

Interested ?
Don't hesitate to apply ! We are looking forward to see your profile and experiences. jidcf017bbafr jit0833afr

Postuler

Fullstack LLM Application Engineer

Description

job-too.ch