Dezhi Yu

Dezhi Yu

Senior ML Engineer

I am a research-oriented machine learning systems engineer working on foundation model infrastructure, alignment, and evaluation. My work focuses on building efficient, reliable systems for large language models while studying the algorithms and data choices that make these models more useful, controllable, and cost-effective in real applications.

At TikTok, my recent work centers on Model-as-a-Service platforms and high-performance LLM inference. I develop serving infrastructure with vLLM and SGLang across model runtime integration, scheduling and continuous batching, KV-cache and memory management, distributed execution, observability, and reliability. This systems work is closely connected to my research on distributed disaggregated inference, preference optimization, instruction-tuning data selection, multimodal evaluation, and retrieval-augmented biomedical summarization.

My broader research spans reinforcement learning for robotics, healthcare sequence modeling, privacy-preserving machine learning, and motion planning. I am especially interested in model-system co-design: how model architecture, inference algorithms, data curation, hardware utilization, scheduling, and distributed runtimes interact. My goal is to advance frontier AI systems that are faster to experiment with, more rigorous to evaluate, and dependable enough to serve at scale.

SQL Practical Optimization featured image

SQL Practical Optimization

Introduce several practical SQL optimization techniques.

avatar
Dezhi Yu
TLS 1.3 Handshake Protocol featured image

TLS 1.3 Handshake Protocol

The handshake protocol is used to negotiate the security parameters of the connection. The handshake messages are provided to the TLS recording layer, where they are encapsulated …

avatar
Dezhi Yu
How to understand gradient descent? featured image

How to understand gradient descent?

Gradient descent is a first-order iterative optimization algorithm for finding a local minimum of a differentiable function. To find a local minimum of a function using gradient …

avatar
Dezhi Yu
Fundamentals of Cryptography featured image

Fundamentals of Cryptography

Introduce some basic knowledge of Cryptography.

avatar
Dezhi Yu
The Practice of Spatial Index in Geographic Service featured image

The Practice of Spatial Index in Geographic Service

I share my practice 《Application of spatial index in geographic service》. The contents are as follows: - How to understand n-dimensional space and n-dimensional space-time - …

avatar
Dezhi Yu
Getting started with Machine Learning featured image

Getting started with Machine Learning

Introduce some basic knowledge of machine learning.

avatar
Dezhi Yu
How to design and implement a thread-safe Map data structure featured image

How to design and implement a thread-safe Map data structure

Map is a very common data structure used to store some unordered key-value pairs. In mainstream programming languages, it comes with its implementation by default. STL in C and C++ …

avatar
Dezhi Yu
iOS Master Book WWDC 2017 featured image

iOS Master Book WWDC 2017

This book is not a systematic study course, but an advanced supplementary book that broadens your horizons, so that readers can access things that are not commonly used in their …

avatar
Dezhi Yu
Functional Reactive Programming Thinking featured image

Functional Reactive Programming Thinking

Introduce Functional Reactive Programming, and Functional Reactive Programming impact on programming thinking.

avatar
Dezhi Yu
iOS Master Book Summer featured image

iOS Master Book Summer

This book is not a systematic study course, but an advanced supplementary book that broadens your horizons, so that readers can access things that are not commonly used in their …

avatar
Dezhi Yu