UNDERGRADUATE RESEARCHER · UW–MADISON · BIOMEDICAL MULTIMODAL AI
本科科研 · 威斯康星大学麦迪逊分校 · 生物医学多模态AI
I study reliable biomedical multimodal AI, with a focus on evidence-grounded generation, uncertainty-aware evaluation, and latent reasoning. My current work includes MARIE, a 3D CT report-generation framework under review at ACM Multimedia 2026, and a first-author ACL ARR / EMNLP 2026 submission on uncertainty quantification for LLM-based systems. I am also a Summer Research Assistant at Tsinghua University, working on latent reasoning for diffusion language models.
我研究可靠的生物医学多模态AI,关注证据约束生成、不确定性评估与latent reasoning。当前工作包括MARIE:一项面向3D CT报告生成的证据token定位框架,正在 ACM Multimedia 2026 审稿;以及一篇第一作者 ACL ARR / EMNLP 2026 投稿,研究LLM系统的不确定性量化。我目前也在清华大学担任暑期科研助理,研究latent reasoning如何提升diffusion language model的性能。
MARIE routes sentence-level 3D CT evidence tokens, exposes only the committed subset through an Evidence Dictionary, and audits laterality/anatomy/depth consistency. → ACM Multimedia 2026 under review; rebuttal completed
MARIE 对3D CT进行句子级证据token路由,通过 Evidence Dictionary 约束生成,并审计左右侧、解剖区域与深度一致性。→ ACM Multimedia 2026 审稿中;rebuttal已结束
First-author ACL ARR / EMNLP 2026 submission benchmarking uncertainty quantification over 45,163 fixed LLM ranking records, with Error@10 detection and paired source-response diagnosis.
第一作者 ACL ARR / EMNLP 2026 投稿:在45,163条固定LLM排序记录上评估UQ信号,包含 Error@10 检测与用户/物品来源诊断。
At Tsinghua University, I am developing a proposal-stage framework that trains masked latent token slots as reasoning states and allocates latent compute according to uncertainty.
在清华大学暑研中,我正在推进一个proposal-stage框架:将masked latent token训练为推理状态,并基于不确定性分配latent compute。
My long-term direction is to build models that ground generated actions, reports, and decisions in explicit perceptual evidence, spanning medical AI agents, embodied AI, and autonomous systems.
长期方向是让模型把生成的动作、报告和决策锚定到明确感知证据上,覆盖医疗AI Agent、具身AI与自动系统。
Developing a proposal-stage framework, Reasoning-Aware Latent Diffusion (RALD), that trains masked latent token slots as reasoning states and adaptively allocates latent compute based on uncertainty. Targeting reasoning benchmarks such as Sudoku, Zebra, Countdown, GSM8K, and MATH500, with possible extensions to medical VQA and multimodal medical reasoning.
正在推进proposal-stage框架 Reasoning-Aware Latent Diffusion (RALD):将masked latent token训练为推理状态,并依据不确定性自适应分配latent compute。计划评估Sudoku、Zebra、Countdown、GSM8K、MATH500,并探索医学VQA与多模态医学推理扩展。
Reliable and verifiable AI generation research. Key outputs: MARIE (ACM MM 2026, co-first author, under review; rebuttal completed), a first-author ACL ARR / EMNLP 2026 submission on uncertainty quantification for LLM-based recommendation, and GOING (TIST, first author, under review). Focus on evidence-committed generation, source-aware reliability evaluation, and training-free RAG robustness.
可靠且可验证的AI生成研究。核心成果:MARIE(ACM MM 2026,共同第一作者,审稿中;rebuttal已结束)、一篇第一作者ACL ARR / EMNLP 2026不确定性量化投稿,以及GOING(TIST,第一作者,审稿中)。研究重点包括证据承诺生成、来源感知可靠性评估与免训练RAG鲁棒性。
Built high-throughput real-time data pipelines for HFT with Python & SQL. Managed a 10M RMB simulation system validating ML-based trend-tracking strategies under volatile market conditions.
用Python & SQL构建高吞吐实时数据管道用于高频交易;管理千万规模仿真系统,验证ML趋势跟踪策略。
Automated preprocessing pipelines for 500+ companies with Pandas; scraped and structured financial logs from Wind/Flush databases.
用Pandas为500余家公司自动化数据预处理;从Wind/Flush抓取并结构化金融日志。
Weekly office hours mentoring undergraduates in ML, statistics, and Python/R.
每周答疑,辅导本科生机器学习、统计学与Python/R。
MadNote transforms research discovery through a fully backend-driven AI pipeline. Built a FastAPI service unifying abstracts, categories, keywords, and interaction metadata. Engineered a multi-round "Discover For You" bubble onboarding flow — users progressively select topics, seed keywords, and expanded keywords — feeding into a personalized weighted ranking engine. Integrated Mistral-7B + RAG for the "Ask Paper" chat box and data analsis+conclusion, enabling per-paper Q&A with robust fallback handling. Designed a BERT-powered keyword extraction layer driving a knowledge graph similarity navigator. Deployed on Vercel (frontend) + Render (backend) with cross-origin auth and cookie management.
MadNote通过完全后端驱动的AI流程重塑论文发现体验。构建FastAPI服务,统一加载摘要、分类、关键词及交互元数据。设计多轮"Discover For You"气泡引导——用户逐步选择话题、种子关键词与扩展关键词,驱动个性化加权排序。集成Mistral-7B + RAG实现"Ask Paper"论文问答和整体的数据分析总结,具备完善降级处理。BERT关键词提取层驱动知识图谱相似度导航。前端Vercel + 后端Render部署,处理跨域认证与Cookie管理。
Established an ML framework assessing how ESG scores influence bond issuance. Benchmarked XGBoost, LightGBM, and KNN on high-dimensional financial features; evaluated with cross-validation, RMSE, MAE, and R².
建立ML框架评估ESG评分对债券发行的影响。对高维金融特征基准测试XGBoost、LightGBM和KNN;通过交叉验证、RMSE、MAE和R²评估。