Taco | Dezhi Yu

Taco

Taco is one of the fast golang-based message push systems in the company, and have serviced 300 million user. Taco provides multiple push methods such as pushing a single user, pushing a group of users in batches, and accurately pushing specific user tags. The low-latency feature greatly empowers the logistics rider’s message reach scenario.

Taco is an Infura-like, API gateway on top of Golang backend services, MySQL and Redis, RabbitMQ and Kafka as the messaging queue, Hive, Blink and Elasticsearch as data statistics and message pipeline query, gRPC, Apache Thrift and HTTP as the communication protocol, which gRPC and Apache Thrift is for internal communication protocol, HTTP is for iOS/Android/H5 external communication protocol.

Last updated on Jun 9, 2026

Back-End Other

Authors

Dezhi Yu

Senior ML Engineer

I am a research-oriented machine learning systems engineer working on foundation model infrastructure, closed-loop evaluation and optimization systems, and scalable AI platforms. My work focuses on building reliable Model-as-a-Service and Harness-as-a-Service platforms that connect data, training, inference, evaluation, and feedback loops into measurable, continuously improving AI products.

My recent work centers on Model-as-a-Service platforms and high-performance LLM inference. I develop serving infrastructure with vLLM and SGLang across model runtime integration, scheduling and continuous batching, KV-cache and memory management, distributed execution, observability, and reliability. This systems work is closely connected to my research on distributed disaggregated inference, preference optimization, instruction-tuning data selection, multimodal evaluation.

My broader research centers on reinforcement learning infrastructure and reinforcement learning optimization algorithms for scalable AI systems. I am interested in how policy optimization, reward modeling, preference learning, offline RL, simulation environments, distributed rollout systems, and automated evaluation harnesses can be engineered together to improve model behavior. My goal is to build frontier AI systems that learn from feedback efficiently, evaluate progress rigorously, and remain dependable when deployed at scale.

← Prometheus May 20, 2020

threes-ai Feb 27, 2018 →