Menu
Hui Sun
Nanjing University
LAMDA Group

Biography

I am currently a Ph.D. student at the School of Artificial Intelligence in Nanjing University , and a member of LAMDA Group , which is led by Prof. Zhi-Hua Zhou (周志华) , and I am supervised by Prof. Ming Li (黎铭) .

I received my B.Sc. from Jilin University in 2019, and my M.Sc. from Nanjing University in 2022. Before starting my Ph.D., I worked as a full-time algorithm engineer at Shopee for over one year, and then returned to Nanjing University for full-time Ph.D. study.

My research interests include large language models (LLMs), machine learning, and information retrieval, with related industry experience at Shopee , ByteDance, and Alibaba.

Jilin University
B.Sc. in SE
Sep 2015 - Jun 2019
Nanjing University
M.Sc. in CS
Sep 2019 - Jun 2022
Nanjing University
Ph.D. in CS
Sep 2023 - Present
Oct 2018 - Mar 2019
ByteDance
Intern
Search Algorithm
Jun 2021 - Sep 2021
AliExpress
Intern
Recommendation Algo.
Jul 2022 - Aug 2023
Shopee
Full-time
Search Algorithm
Sep 2023 - Jun 2025
Alibaba
Intern
Multimodal LLM Research
Jul 2025 - Mar 2026
TikTok
Intern (筋斗云)
LLM Research for Rec.
Mar 2026 - Present
Alibaba Cloud
Intern
LLM Research

Research Interests

My recent research focuses on the following areas:

Machine Learning

with a focus on Transfer Learning

Publications

Preprints & Under Review

P1
Tech Report

Ovis2.5 Technical Report

Ovis Team (Contributor: Hui Sun), Alibaba Group

2025. Technical Report, arXiv: 2508.11737

P2
Preprint

ACES: Who Tests the Tests? Leave-One-Out AUC Consistency for Code Generation

Hui Sun†, Yun-Ji Zhang† (†Equal Contribution), Zheng Xie, Ren-Biao Liu, Yali Du, Xin-Ye Li, and Ming Li

2026. Preprint, arXiv: 2604.03922

P3
Preprint

Design-Specification Tiling for ICL-based CAD Code Generation

Yali Du, Sanzhuo Xi, Hui Sun, and Ming Li

2026. Preprint, arXiv: 2603.12712

P4
Under Review

Loading...

Xin-Ye Li, Ren-Biao Liu, Yun-Ji Zhang, Hui Sun, Zheng Xie, and Ming Li

P5
Under Review

Loading...

Ren-Biao Liu, Xin-Ye Li, Hui Sun, Yali Du, Jiang-Tian Xue, Ming Li

P6
Under Review

Loading...

Xin-Ye Li, Yali Du, Hui Sun, and Ming Li

Journal Papers

J1
TPAMI 2026 CCF-A SCI

Mitigating Negative Transfer via Reducing Environmental Disagreement

Hui Sun, Zheng Xie, Hao-Yuan He, and Ming Li

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2026

J2
TSE 2025 CCF-A SCI
FSE 2026
CCF-A

Post-Incorporating Code Structural Knowledge into Pretrained Models via ICL for Code Translation

Yali Du†, Hui Sun† (Equal Contribution), and Ming Li

IEEE Transactions on Software Engineering, 2025

This work has been selected for presentation at the ACM International Conference on the Foundations of Software Engineering (FSE), 2026 through the Journal First Track, which invites recently published high-quality journal articles to be presented at the conference.

J3
TORS 2024

Learning Personalizable Clustered Embedding for Recommender Systems

Yizhou Chen, Guangda Huzhang, Anxiang Zeng, Qingtao Yu, Hui Sun, Heng-Yi Li, Jingyi Li, Yabo Ni, Han Yu, and Zhiming Zhou

ACM Transactions on Recommender Systems, 2024

J4
SCIS 2023 CCF-A SCI

Enhancing Unsupervised Domain Adaptation by Exploiting the Conceptual Consistency of Multiple Self-supervised Tasks

Hui Sun, and Ming Li

SCIENCE CHINA Information Sciences, 2023, 66: 142101

Conference Papers

C1
WWW 2026 CCF-A

OneTrans: Unified Feature Interaction and Sequence Modeling with One Transformer in Industrial Recommender

Zhaoqi Zhang, Haplei Pei, Jun Guo, Tianyu Wang, Yufei Feng, Hui Sun, Shaowei Liu, Aixin Sun

The ACM Web Conference, 2026

C2
AAAI 2026 CCF-A

Dynamic-Static Synergistic Selection Method for Candidate Code Solutions with Generated Test Cases

Ren-Biao Liu, Jiang-Tian Xue, Chao-Zeng Ma, Hui Sun, Xin-Ye Li, and Ming Li

The 40th AAAI Conference on Artificial Intelligence, 2026

C3
AAAI 2026 CCF-A

ARBench: Algorithmic Reasoner or API Alchemist? Evaluating Code-Generating LLMs beyond API Calls.

Ren-Biao Liu, Chao-Zeng Ma, An-Qi Li, Hui Sun, Xin-Ye Li, and Ming Li

The 40th AAAI Conference on Artificial Intelligence, 2026

C4
ICCV 2025 CCF-A

MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs

Hui Sun, Shiyin Lu, Huanyu Wang, Qing-Guo Chen, Zhao Xu, Weihua Luo, Kaifu Zhang, Ming Li

IEEE/CVF International Conference on Computer Vision, 2025

C5
ICML 2025 CCF-A

Revisiting Chain-of-Thought in Code Generation: Do Language Models Need to Learn Reasoning before Coding?

Ren-Biao Liu, An-Qi Li, Chao-Ding Yang, Hui Sun, and Ming Li

The 42nd International Conference on Machine Learning, 2025

C6
ASE 2024 CCF-A

A Joint Learning Model with Variational Interaction for Multilingual Program Translation

Yali Du, Hui Sun, and Ming Li

IEEE/ACM 46th International Conference on Automated Software Engineering, 2024

C7
ICML 2024 CCF-A

Ambiguity-Aware Abductive Learning

Hao-Yuan He, Hui Sun, Zheng Xie, and Ming Li

The 41st International Conference on Machine Learning, 2024

C8
AAAI 2023 CCF-A

Cooperative and Adversarial Learning: Co-Enhancing Discriminability and Transferability in Domain Adaptation

Hui Sun, Zheng Xie, Xin-Ye Li, and Ming Li

The 37th AAAI Conference on Artificial Intelligence, 2023

C9
AAAI 2023 CCF-A

Semi-Supervised Learning with Support Isolation by Small-Paced Self-Training

Zheng Xie, Hui Sun, and Ming Li

The 37th AAAI Conference on Artificial Intelligence, 2023

C10
WWW 2023 CCF-A

Clustered Embedding Learning for Recommender Systems

Yizhou Chen, Guangda Huzhang, Anxiang Zeng, Qingtao Yu, Hui Sun, Heng-Yi Li, Jingyi Li, Yabo Ni, Han Yu, and Zhiming Zhou

The World Wide Web Conference, 2023

Patents

PT1
已授权

一种面向环境变化的无监督迁移学习图像分类算法

Ming Li (黎铭), Hui Sun (孙辉), Zhi-Hua Zhou (周志华)

Patent No. 202210461879.4

Awards & Honors

ByteDance Soaring Star Talent Program

筋斗云人才计划

ByteDance · 2025

Value Star Awards

Top 4% (8 out of 200+)

Shopee · Dec 2022

Artificial Intelligence Scholarship

50 recipients across 9 AI-related schools

Nanjing University · 2019

China Collegiate Computing Contest (CCCC)

Outstanding Winner (Highest Honor)

Jilin Province · Mar 2018

ACM-ICPC Asia Regional Contest

Silver Medal

Nanning · Dec 2017

ACM-ICPC Asia EC-Final

Bronze Medal

Shanghai · Dec 2017

Northeast Collegiate Programming Contest

First Prize

Changchun · May 2017

Jilin Province Collegiate Programming Contest

First Prize

Jilin Province · 2017

Work Experience

I have 4.5+ years of industry experience in algorithm R&D across recommendation systems and multimodal foundation models, with work at ByteDance, Alibaba, and Shopee. My experience focuses on building and iterating production systems for ranking, retrieval, and large-scale model applications, with an emphasis on measurable online impact.

ByteDance - TikTok Shop Rec.

Recommendation LLM Algorithm Intern | Soaring Star Talent Program (筋斗云人才计划)

Jul 2025 - Mar 2026

Focused on scaling up TikTok Shop recommendation models for both fine-ranking and retrieval. Core work included unifying the fine-ranking architecture with a decoder-only transformer and independently designing and deploying a generative recall solution that delivered clear offline and online gains.

Fine-Ranking - OneTrans Unified Modeling: Core contributor to the design and deployment of OneTrans, which replaced traditional feature crossing and sequence modules with a unified decoder-only transformer for TikTok Shop fine-ranking, bringing significant online gains across multiple core scenarios.
+1.36%

GMV/u

E-commerce Overall

+1.35%

GMV/u

TikTok Shop Overall

+1.77%

GMV/u

Mall Overall

+2.47%

GMV/u

Mall Feeds

WWW'26

Co-author

OneTrans

Recall - Generative Retrieval: Served as PIC to design, implement, and launch a generative recall solution from end to end, with substantial improvements in offline HitRate and strong online A/B gains in CVR, CTCVR, and GPM.
+2.85%

HitRate@100

Offline Metric

+101.83%

PV CVR

A/B Testing

+45.59%

PV CTCVR

A/B Testing

+67.66%

GPM

A/B Testing

Academic Impact: The OneTrans work was published at WWW 2026 and received broad recognition from industry peers including Meta.

Alibaba AIDC - Ovis Team

Multimodal LLM Research Intern | Ovis Team

Sep 2023 - Jun 2025

Early core contributor to the Ovis series, deeply involved in the iteration from V1.6 to V2.5. Led the build-out of multi-image and video understanding capabilities from 0 to 1, proposed the MDP3 method for stronger multi-image and long-video understanding, and contributed to GUI Agent SFT and web trajectory data automation.

Top #1

Ovis 2.5

<40B

Top #2

Ovis 2.0

OpenCompass

ICCV'25

MDP3

First-author

Tech Report

Ovis2.5

Contributor

Open-Source and Community Impact: The Ovis series was open-sourced on Hugging Face and repeatedly covered by Hugging Face and QbitAI.
4.7M+

Downloads

HuggingFace

2k+

Likes

HuggingFace

1.4k+

GitHub Stars

Ovis Repo

Academic Impact: Related work was published at ICCV 2025 as first author, and I also contributed to the Ovis 2.5 technical report.

Shopee (Sea Limited)

Core Search Algorithm Engineer (Full-time) | E-commerce Search Ranking

Jul 2022 - Aug 2023

Led the refactoring of Shopee's core search fine-ranking training and serving framework, maintained online models across all sites, and helped extend the refactored design to recall, coarse-ranking, and long-tail teams through modular configuration, pretrained parameter reuse, and user-side computation reuse.

Architecture - Fine-Ranking System Refactor: Rebuilt the production framework to improve convergence efficiency, offline training speed, and online inference latency, while making the solution reusable across multiple ranking stages.
2m → 1wk

Convergence Data

Module-wise Reuse

+105%

Offline Training

37→76 samples/(c·s)

+232%

Online Inference

95.5ms→28.8ms

10x

CEL Runtime

Real-world Deployment

Algorithm - Multi-task and Long-tail Optimization: Introduced PLE for CTR/CVR multi-task learning, AutoDis for discretizing numerical features, and CEL for better long-tail representation learning.
+0.5%

PLE

CTR AUC

+0.2/+0.3%

AutoDis

CTR/CVR AUC

+0.6%

CEL

CTR AUC

WWW'23

CEL [C]

Co-author

TORS'24

CEL [J]

Co-author

Academic Impact: The CEL work was later published at WWW 2023 and ACM TORS 2024.

AliExpress (Alibaba Group)

Recommendation Algorithm Intern | Fine-Ranking

Jun 2021 - Sep 2021

Worked on multi-objective fine-ranking and country-specific modeling for AliExpress recommendation. Reproduced 14 frontier recommendation papers and completed algorithmic improvements that significantly lifted both CTR and L2P performance in AUC and GAUC.

+0.50%

CTR

AUC

+0.53%

L2P

AUC

+0.78%

CTR

GAUC

+1.17%

L2P

GAUC

ByteDance

Search Algorithm Intern | Vertical Search End-to-End

Oct 2018 - Mar 2019

Worked on end-to-end engineering and algorithm optimization for account vertical search and general search cards in Toutiao apps. Rewrote the Elasticsearch-based recall pipeline and introduced the first GBDT ranking model for Toutiao account search, leading to clear online metric gains.

+30%

Query CTR

Vertical Search

+60%

Recall Rate

General Search

+5%

Top 3 CTR

General Search

Education

Sep 2023 - Present (Expected Jun 2027)

Ph.D. in Computer Science and Technology, School of Artificial Intelligence

LAMDA Group · Supervisor: Prof. Ming Li (黎铭)

M.Sc. in Computer Science and Technology, School of Artificial Intelligence

Recommended admission to the LAMDA Group (without entrance examination); ranked 1st in the interview coding test.

B.Sc. in Software Engineering (Excellent Engineer Program)

GPA: 3.7/4.0 (Top 5%); ITMO University Exchange (2017)

© 2025 Hui Sun (孙辉). Last updated: November 2025