Hao (Edison) Wu

吴昊

UNDERGRADUATE RESEARCHER  ·  UW–MADISON  ·  BIOMEDICAL MULTIMODAL AI

本科科研  ·  威斯康星大学麦迪逊分校  ·  生物医学多模态AI

I study reliable biomedical multimodal AI, with a focus on evidence-grounded generation, uncertainty-aware evaluation, and latent reasoning. My current work includes MARIE, a 3D CT report-generation framework under review at ACM Multimedia 2026, and a first-author ACL ARR / EMNLP 2026 submission on uncertainty quantification for LLM-based systems. I am also a Summer Research Assistant at Tsinghua University, working on latent reasoning for diffusion language models.

我研究可靠的生物医学多模态AI,关注证据约束生成、不确定性评估与latent reasoning。当前工作包括MARIE:一项面向3D CT报告生成的证据token定位框架,正在 ACM Multimedia 2026 审稿;以及一篇第一作者 ACL ARR / EMNLP 2026 投稿,研究LLM系统的不确定性量化。我目前也在清华大学担任暑期科研助理,研究latent reasoning如何提升diffusion language model的性能。

Hao Wu

Research

研究方向

01 · MEDICAL MULTIMODAL AI
01 · 医学多模态AI
Evidence-Grounded 3D CT Report Generation
证据约束的3D CT报告生成

MARIE routes sentence-level 3D CT evidence tokens, exposes only the committed subset through an Evidence Dictionary, and audits laterality/anatomy/depth consistency. → ACM Multimedia 2026 under review; rebuttal completed

MARIE 对3D CT进行句子级证据token路由,通过 Evidence Dictionary 约束生成,并审计左右侧、解剖区域与深度一致性。→ ACM Multimedia 2026 审稿中;rebuttal已结束

02 · RELIABILITY & UQ
02 · 可靠性与不确定性
Source-Aware Evaluation for LLM Systems
面向LLM系统的来源感知评估

First-author ACL ARR / EMNLP 2026 submission benchmarking uncertainty quantification over 45,163 fixed LLM ranking records, with Error@10 detection and paired source-response diagnosis.

第一作者 ACL ARR / EMNLP 2026 投稿:在45,163条固定LLM排序记录上评估UQ信号,包含 Error@10 检测与用户/物品来源诊断。

03 · LATENT REASONING
03 · Latent Reasoning
Diffusion Language Models with Adaptive Latent Compute
自适应latent compute的扩散语言模型

At Tsinghua University, I am developing a proposal-stage framework that trains masked latent token slots as reasoning states and allocates latent compute according to uncertainty.

在清华大学暑研中,我正在推进一个proposal-stage框架:将masked latent token训练为推理状态,并基于不确定性分配latent compute。

04 · AI AGENTS & SPATIAL GROUNDING
04 · AI Agent与空间定位
From Medical AI Agents to VLA Systems
从医疗AI Agent到VLA系统

My long-term direction is to build models that ground generated actions, reports, and decisions in explicit perceptual evidence, spanning medical AI agents, embodied AI, and autonomous systems.

长期方向是让模型把生成的动作、报告和决策锚定到明确感知证据上,覆盖医疗AI Agent、具身AI与自动系统。

Publications & Manuscripts

论文与手稿

Under Review
审稿中
MARIE: Multimodal Routing with Evidence Token Grounding for 3D CT Report Generation
MARIE:基于多模态路由与证据Token定位的3D CT报告生成
Tianyi Chen*, Hao Wu*, TianMing Sha, Sipeng Chen, Junnan Li, Hang Zhou, Xiao Luo — *Co-first Authors
陈天一*, 吴昊*, 沙天明, 陈思朋, 李俊男, 周航, 罗潇 — *共同第一作者
ACM Multimedia 2026 · Submission #4550 · Rebuttal completed, decision pending Under Review 审稿中
Towards Realistic Evaluation of Uncertainty Quantification for LLM-based Recommender Systems
面向LLM推荐系统不确定性量化的真实评估
Hao Wu, Tianyi Chen, Sipeng Chen, Hang Zhou, Xiao Luo — First Author
吴昊, 陈天一, 陈思朋, 周航, 罗潇 — 第一作者
ACL ARR 2026 May · Preferred venue: EMNLP 2026 · Submission #14271 Under Review 审稿中
GOING: Generative Mode Switching with Confidence Gain for Real-World RAG Noise Mitigation
GOING:基于置信度增益的生成模式切换以缓解真实场景RAG噪声
Hao Wu, Tianyi Chen, Sipeng Chen, Hang Zhou, Xiao Luo — First Author
吴昊, 陈天毅, 陈思鹏, 周航, 罗潇 — 第一作者
ACM Transactions on Intelligent Systems and Technology · 2026 Under Review 审稿中
Ongoing / In Preparation
进行中 / 准备中
Reasoning-Aware Latent Diffusion for Diffusion Language Models
面向扩散语言模型的Reasoning-Aware Latent Diffusion
Hao Wu, [collaborators-First Author] — Tsinghua Summer RA; proposal-stage project
吴昊, [合作者-第一作者] — 清华大学;proposal-stage项目
Latent reasoning · Diffusion language models · 2026 Ongoing 进行中
Reliable Multi-Property Molecule Optimization with Instruction-Tuned LLMs for Drug Discovery
面向药物发现的指令微调LLM多属性分子优化
Hao Wu, [collaborators] — Co-Author
吴昊, [合作者] — 共同作者
Target: AAAI / BIBM· In preparation In Prep 准备中
GAN-Based Stock Price Prediction Using a GRU-CNN Architecture
基于GRU-CNN架构的GAN股价预测
Hao Wu — First Author
吴昊 — 第一作者
Manuscript in preparation · 2023 In Prep 准备中
Published
已发表
Forecast of Bond Issuance Based on ESG Score
基于ESG评分的债券发行预测
Hao Wu et al. — Co-first Author
吴昊— 共同第一作者
Academic Journal of Mathematical Sciences · 2023 · 2,400+ views Published 已发表

Experience

经历

Tsinghua University
清华大学
SUMMER RESEARCH ASSISTANT — LATENT REASONING FOR DIFFUSION LANGUAGE MODELS
暑期科研助理 — 面向扩散语言模型的LATENT REASONING

Developing a proposal-stage framework, Reasoning-Aware Latent Diffusion (RALD), that trains masked latent token slots as reasoning states and adaptively allocates latent compute based on uncertainty. Targeting reasoning benchmarks such as Sudoku, Zebra, Countdown, GSM8K, and MATH500, with possible extensions to medical VQA and multimodal medical reasoning.

正在推进proposal-stage框架 Reasoning-Aware Latent Diffusion (RALD):将masked latent token训练为推理状态,并依据不确定性自适应分配latent compute。计划评估Sudoku、Zebra、Countdown、GSM8K、MATH500,并探索医学VQA与多模态医学推理扩展。

Diffusion LMLatent ReasoningAdaptive ComputePyTorch
May. 2026 – Present
Beijing / Remote
北京 / 远程
UW–Madison, Department of Statistics
威斯康星大学麦迪逊分校统计系
UNDERGRADUATE RESEARCH ASSISTANT  ·  Advisor: Prof. Xiao Luo
本科科研助理  ·  导师:罗潇 教授

Reliable and verifiable AI generation research. Key outputs: MARIE (ACM MM 2026, co-first author, under review; rebuttal completed), a first-author ACL ARR / EMNLP 2026 submission on uncertainty quantification for LLM-based recommendation, and GOING (TIST, first author, under review). Focus on evidence-committed generation, source-aware reliability evaluation, and training-free RAG robustness.

可靠且可验证的AI生成研究。核心成果:MARIE(ACM MM 2026,共同第一作者,审稿中;rebuttal已结束)、一篇第一作者ACL ARR / EMNLP 2026不确定性量化投稿,以及GOING(TIST,第一作者,审稿中)。研究重点包括证据承诺生成、来源感知可靠性评估与免训练RAG鲁棒性。

3D CTMedical VLMRAGUQLlama-3SwinUNETR
Sep. 2025 – Present
Madison, WI
麦迪逊
Guo Dao Asset Management Co., Ltd.
国道资产管理有限公司
DATA ENGINEERING INTERN — QUANTITATIVE TRADING
数据工程实习生(量化交易)

Built high-throughput real-time data pipelines for HFT with Python & SQL. Managed a 10M RMB simulation system validating ML-based trend-tracking strategies under volatile market conditions.

用Python & SQL构建高吞吐实时数据管道用于高频交易;管理千万规模仿真系统,验证ML趋势跟踪策略。

PythonSQLHFTBacktestingQMT
Jun. – Aug. 2025
Shanghai
上海
Dong Hai Securities Co., Ltd.
东海证券有限责任公司
DATA ANALYST INTERN
数据分析实习生

Automated preprocessing pipelines for 500+ companies with Pandas; scraped and structured financial logs from Wind/Flush databases.

用Pandas为500余家公司自动化数据预处理;从Wind/Flush抓取并结构化金融日志。

PythonPandasWind/Flush
Jul. – Aug. 2023
Changzhou
常州
UW–Madison, Department of Statistics
威斯康星大学麦迪逊分校统计系
DATA SCIENCE PEER MENTOR
数据科学朋辈导师

Weekly office hours mentoring undergraduates in ML, statistics, and Python/R.

每周答疑,辅导本科生机器学习、统计学与Python/R。

Jul. 2025 – Present
Madison, WI
麦迪逊

Projects

项目

MadNote — AI-Powered Academic Discovery Platform
MadNote — AI驱动的学术论文发现平台
MadData 2026 Hackathon
BACKEND & AI DEVELOPER · dotData @ UW–Madison · Feb. 2026
后端与AI开发 · dotData @ UW–Madison · 2026年2月

MadNote transforms research discovery through a fully backend-driven AI pipeline. Built a FastAPI service unifying abstracts, categories, keywords, and interaction metadata. Engineered a multi-round "Discover For You" bubble onboarding flow — users progressively select topics, seed keywords, and expanded keywords — feeding into a personalized weighted ranking engine. Integrated Mistral-7B + RAG for the "Ask Paper" chat box and data analsis+conclusion, enabling per-paper Q&A with robust fallback handling. Designed a BERT-powered keyword extraction layer driving a knowledge graph similarity navigator. Deployed on Vercel (frontend) + Render (backend) with cross-origin auth and cookie management.

MadNote通过完全后端驱动的AI流程重塑论文发现体验。构建FastAPI服务,统一加载摘要、分类、关键词及交互元数据。设计多轮"Discover For You"气泡引导——用户逐步选择话题、种子关键词与扩展关键词,驱动个性化加权排序。集成Mistral-7B + RAG实现"Ask Paper"论文问答和整体的数据分析总结,具备完善降级处理。BERT关键词提取层驱动知识图谱相似度导航。前端Vercel + 后端Render部署,处理跨域认证与Cookie管理。

Ranking: score = 0.45 × topic_match + 0.35 × keyword_overlap + 0.15 × freshness + 0.05 × popularity
排序公式:score = 0.45 × 话题匹配 + 0.35 × 关键词重叠 + 0.15 × 新鲜度 + 0.05 × 热度
FastAPIMistral-7BRAGBERTReact + ViteVercelPython
ESG Bond Issuance Forecasting
ESG债券发行预测研究
Published · 2,400+ views
RESEARCHER & CO-FIRST AUTHOR · Mar. – May 2023
研究员 & 共同第一作者 · 2023年3月–5月

Established an ML framework assessing how ESG scores influence bond issuance. Benchmarked XGBoost, LightGBM, and KNN on high-dimensional financial features; evaluated with cross-validation, RMSE, MAE, and R².

建立ML框架评估ESG评分对债券发行的影响。对高维金融特征基准测试XGBoost、LightGBM和KNN;通过交叉验证、RMSE、MAE和R²评估。

XGBoostLightGBMKNNCross-validationESG

Education

教育背景

University of Wisconsin–Madison
威斯康星大学麦迪逊分校
B.S. Data Science & Information Science (Double Major)
数据科学与信息科学双专业理学学士
GPA 4.0/4.0 · L&S Dean's ListGPA 4.0/4.0 · 院长名单
Sep.2024 – Dec. 2026
Madison, WI
麦迪逊
Xi'an Jiao Tong–Liverpool University (XJTLU)
西交利物浦大学
Year 1–2, Financial Math · Transferred to UW–Madison 2024
大一至大二,金融数学 · 2024年转至UW–Madison
University Excellence Award校级优秀奖
2022 – 2024
Suzhou, China
苏州

Technical Skills

技术技能

Programming编程语言Python · R · MySQL · MongoDB · MATLAB
ML / Data机器学习PyTorch · Scikit-learn · Pandas · NumPy · XGBoost · LightGBM · GANs · Spark · HDFS · R/Shiny
Biomedical AI医学AI3D CT Report Generation · Medical VLMs · Evidence Grounding · SwinUNETR-style 3D Encoders · SimpleITK / NIfTI · Medical VQA Planning
LLM Engineering大模型工程Hugging Face · RAG · FastAPI · Prompt Engineering · Multi-agent · Llama-3 · Mistral · BERT · Diffusion LM · Latent Reasoning
Finance Tools金融工具Bloomberg · Wind-Stock-DB · QMT · Financial Modeling · Stock APIs
Languages语言 English (Fluent) · Mandarin (Native) · Spanish (Elementary) 英语(流利)· 普通话(母语)· 西班牙语(入门)