/ L'annuaire des offres d'emploi en Suisse Romande
n/a n/a Grand-Lancy CH
full-time

Fullstack LLM Application Engineer

Entreprise
Experis
Lieu
Grand-Lancy
Date de publication
14.08.2025
Référence
4947523

Description

Experis SA - Your IT Partner in Switzerland, , Experis SA, the Swiss branch of the ManpowerGroup, is a leader in recruiting specialized IT profiles. We offer flexible solutions: temporary assignments, permanent placements, project management, and professional training. Our expertise covers digital transformation, cybersecurity, cloud, infrastructure, and more. Thanks to our global reach combined with a pragmatic and personalized approach, we effectively support both professionals and companies in achieving their goals.Fullstack LLM Application EngineerExperis SA - Your IT Partner in Switzerland
Experis SA, the Swiss branch of the ManpowerGroup, is a leader in recruiting specialized IT profiles. We offer flexible solutions: temporary assignments, permanent placements, project management, and professional training. Our expertise covers digital transformation, cybersecurity, cloud, infrastructure, and more. Thanks to our global reach combined with a pragmatic and personalized approach, we effectively support both professionals and companies in achieving their goals.

Title: LLM Application Engineer (Full-Stack) - AI Chatbots for Financial Environments
Location : Geneva, Switzerland - Hybrid Work Model
Type : Permanent
Languages : Fluent English (C1+), other languages an asset
About the Role
We are looking for an experienced Full-Stack Engineer to design, build, and deploy AI-powered chatbot applications serving mission-critical functions across trading, middle office, and back office environments. You will take end-to-end ownership of the application stack, from UI to secure backend APIs and integrations with existing enterprise systems, collaborating closely with data science and infrastructure teams.
Key Responsibilities

  • Chat UI Development: Create responsive React/TypeScript interfaces with SSO, semantic search, and chat patterns.
  • API & Model Serving: Build FastAPI REST/gRPC endpoints, deploy LLMs using Triton, Ray Serve, or AWS SageMaker with GPU-aware autoscaling.
  • Security: Implement OAuth2/OIDC, prompt validation, rate limits, audit logging, and row-level entitlements.
  • Integration: Connect with Java/.NET systems, FIX, Kafka, and message queues with low latency.
  • Monitoring & Deployment: Configure observability dashboards (Prometheus/Grafana), manage GitLab CI/CD, perform blue-green deployments.
  • Infrastructure Automation: Contribute Terraform modules for EKS, API Gateways, Transit Gateways, and Lambda functions.

Technical Skills Required

  • Python , FastAPI, LangChain/Haystack, React, TypeScript
  • Hugging Face, AWS Bedrock SDK, Triton Inference Server, Ray Serve, AWS SageMaker
  • OAuth2, OIDC, SSO
  • SQL, pgvector, Pinecone, OpenSearch
  • Docker, Kubernetes/EKS, Terraform, GitLab CI
  • REST, gRPC, WebSockets, SSE

Profile

  • 4+ years in developing cloud-native, user-facing applications (finance/trading preferred).
  • Proven deployment of LLMs or deep learning models in production.
  • Strong presentation and stakeholder communication skills.
  • Ability to thrive in a high-speed environment while maintaining enterprise-grade reliability.

Interested ?
Don't hesitate to apply ! We are looking forward to see your profile and experiences. jidcf017bbafr jit0833afr

Postuler