GPU Engineer

Compagnie : Kog & Company

Type d'emploi : À plein temps

France

Nombre de candidats

000+

Envoyer une candidature maintenant

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications

Activate JobCopilot

Description de l'emploi - GPU Engineer

About Kog

Kog builds the fastest LLM inference engine on standard datacenter GPUs. Our Kog Inference Engine generates 3,000 output tokens per second per request on a single 8× AMD MI300X node and 2,100 on an 8× NVIDIA H200 node (FP16, batch size 1, no speculative decoding).

We co-design the model architecture and the execution engine together. Our Laneformer model uses Delayed Tensor Parallelism (DTP), a novel architecture that restructures the Transformer dependency graph so inter-GPU communication overlaps with computation rather than blocking it.

We pretrained a 2B-parameter DTP model on 6T tokens on 256 H100 GPUs.

We are a team of 11 people, including 10 engineers and 4 PhDs.

Test it at playground.kog.ai. Read the technical details on the Kog Labs blog.

What you will work on

You will run experiments to understand how architectural decisions propagate through training dynamics and inference behavior, propose and test variants, and turn findings into measurable gains in model quality and generation speed.

Design new model architectures, including routing strategies, attention mechanisms, and MoE structure, with execution constraints as a first-order design input.
Extend the Laneformer thesis by exploring inference-aware architectural variants such as DTP, Ladder Residual, and PT-Transformer, and finding what compounds at scale.
Own the training pipeline across convergence stability, distributed training efficiency, data pipeline design, and evaluation methodology.
Scale the stack to large MoE models such as DeepSeek v4 and Qwen 3, working through routing, expert parallelism, and communication patterns at inference time.
Contribute to building AI agents that will perform architecture research and training experiments autonomously, starting from the research foundations we are building now.

What we look for

You have trained large models from scratch and understand training dynamics at the level where you can diagnose issues from first principles.

Showing the work is a requirement to move forward in the process. This means a paper, a repository, or a codebase with concrete evidence of what you built and what it produced.

Strong signals include experience with architecture-hardware co-design or inference-aware training, fluency in Transformers and MoE with enough depth to reason across trade-offs, and production-grade training code in PyTorch or JAX.

Post-training work such as speculative decoding, quantization-aware training, or preference optimization is an acceptable starting point if it demonstrates a genuine understanding of how those methods affect model behavior and deployment.

You will spend at least 50% of your time in our Paris office.

What we offer

Direct access to AMD and NVIDIA datacenter GPUs from day one
A team where creativity and technical judgment carry weight and where the people closest to the problem shape the key decisions
Problems that sit on the critical path of model execution speed and that directly influence what the system can become
A remote-friendly working model, though you'll spend at least 50% of your time in our Paris office.
Compensation aligned with top technical profiles in the Paris GPU Engineering market, including equity

Original job GPU Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Envoyer une candidature maintenant

Auto-Apply to Similar Jobs

Share Job

Get your Resume Reviewed for Free

Automate Job Applications for Similar Jobs

Auto-Apply to GPU Engineer Jobs with your AI JobCopilot

Auto-Apply with AI

Similar GPU Engineer Jobs in France

Get your Resume Reviewed for Free

Adresse e-mail

Pourquoi signalez-vous ce travail?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

Toutes les offres d'emploi sont soumises aux Terms of Service de GrabJobs. Nous permettons aux utilisateurs de signaler les annonces qui pourraient enfreindre ces conditions. Les offres d'emploi peuvent également être signalées par l'équipe de modération de GrabJobs. Cependant, aucun système de modération n'est parfait, et le fait de signaler une annonce ne garantit pas qu'elle sera supprimée.

Setup your job alert:

Frequency

E-mail

En activant les alertes d'emploi, j'accepte les Terms et Privacy Policy de GrabJobs. Je peux me désabonner des alertes d'emploi à tout moment. Sauter