Dezhi Yu

Dezhi Yu

Senior ML Engineer

I am a research-oriented machine learning systems engineer working on foundation model infrastructure, alignment, and evaluation. My work focuses on building efficient, reliable systems for large language models while studying the algorithms and data choices that make these models more useful, controllable, and cost-effective in real applications.

At TikTok, my recent work centers on Model-as-a-Service platforms and high-performance LLM inference. I develop serving infrastructure with vLLM and SGLang across model runtime integration, scheduling and continuous batching, KV-cache and memory management, distributed execution, observability, and reliability. This systems work is closely connected to my research on distributed disaggregated inference, preference optimization, instruction-tuning data selection, multimodal evaluation, and retrieval-augmented biomedical summarization.

My broader research spans reinforcement learning for robotics, healthcare sequence modeling, privacy-preserving machine learning, and motion planning. I am especially interested in model-system co-design: how model architecture, inference algorithms, data curation, hardware utilization, scheduling, and distributed runtimes interact. My goal is to advance frontier AI systems that are faster to experiment with, more rigorous to evaluate, and dependable enough to serve at scale.

Eleme Quarterly Newcomer Report featured image

Eleme Quarterly Newcomer Report

Introduce what achievements have been accomplished in the past three months at work.

avatar
Dezhi Yu
Weex layout engine powered by FlexBox algorithm featured image

Weex layout engine powered by FlexBox algorithm

In the last article, we talked about the basic process of Weex working on the iOS client. This article will analyze in detail how Weex lays out the native interface with high …

avatar
Dezhi Yu
iOS Master Book Spring featured image

iOS Master Book Spring

This book is not a systematic study course, but an advanced supplementary book that broadens your horizons, so that readers can access things that are not commonly used in their …

avatar
Dezhi Yu
iOS Architecture Smalltalk featured image

iOS Architecture Smalltalk

Briefly talk about several iOS infrastructures and their respective advantages and disadvantages.

avatar
Dezhi Yu
Phabricator Introduce featured image

Phabricator Introduce

Phabricator Use guide introduce —— How to use phabricator for code review.

avatar
Dezhi Yu