Fullstack LLM Application Engineer
- Entreprise
- Experis
- Lieu
- Grand-Lancy
- Date de publication
- 14.08.2025
- Référence
- 4947523
Description
Experis SA - Your IT Partner in Switzerland, , Experis SA, the Swiss branch of the ManpowerGroup, is a leader in recruiting specialized IT profiles. We offer flexible solutions: temporary assignments, permanent placements, project management, and professional training. Our expertise covers digital transformation, cybersecurity, cloud, infrastructure, and more. Thanks to our global reach combined with a pragmatic and personalized approach, we effectively support both professionals and companies in achieving their goals.Fullstack LLM Application EngineerExperis SA - Your IT Partner in Switzerland
Experis SA, the Swiss branch of the ManpowerGroup, is a leader in recruiting specialized IT profiles. We offer flexible solutions: temporary assignments, permanent placements, project management, and professional training. Our expertise covers digital transformation, cybersecurity, cloud, infrastructure, and more. Thanks to our global reach combined with a pragmatic and personalized approach, we effectively support both professionals and companies in achieving their goals.
Title: LLM Application Engineer (Full-Stack) - AI Chatbots for Financial Environments
Location : Geneva, Switzerland - Hybrid Work Model
Type : Permanent
Languages : Fluent English (C1+), other languages an asset
About the Role
We are looking for an experienced Full-Stack Engineer to design, build, and deploy AI-powered chatbot applications serving mission-critical functions across trading, middle office, and back office environments. You will take end-to-end ownership of the application stack, from UI to secure backend APIs and integrations with existing enterprise systems, collaborating closely with data science and infrastructure teams.
Key Responsibilities
- Chat UI Development: Create responsive React/TypeScript interfaces with SSO, semantic search, and chat patterns.
- API & Model Serving: Build FastAPI REST/gRPC endpoints, deploy LLMs using Triton, Ray Serve, or AWS SageMaker with GPU-aware autoscaling.
- Security: Implement OAuth2/OIDC, prompt validation, rate limits, audit logging, and row-level entitlements.
- Integration: Connect with Java/.NET systems, FIX, Kafka, and message queues with low latency.
- Monitoring & Deployment: Configure observability dashboards (Prometheus/Grafana), manage GitLab CI/CD, perform blue-green deployments.
- Infrastructure Automation: Contribute Terraform modules for EKS, API Gateways, Transit Gateways, and Lambda functions.
Technical Skills Required
- Python , FastAPI, LangChain/Haystack, React, TypeScript
- Hugging Face, AWS Bedrock SDK, Triton Inference Server, Ray Serve, AWS SageMaker
- OAuth2, OIDC, SSO
- SQL, pgvector, Pinecone, OpenSearch
- Docker, Kubernetes/EKS, Terraform, GitLab CI
- REST, gRPC, WebSockets, SSE
Profile
- 4+ years in developing cloud-native, user-facing applications (finance/trading preferred).
- Proven deployment of LLMs or deep learning models in production.
- Strong presentation and stakeholder communication skills.
- Ability to thrive in a high-speed environment while maintaining enterprise-grade reliability.
Interested ?
Don't hesitate to apply ! We are looking forward to see your profile and experiences. jidcf017bbafr jit0833afr