Hao (Edison) Wu

吴昊

UNDERGRADUATE RESEARCHER · UW–MADISON · BIOMEDICAL MULTIMODAL AI

本科科研 · 威斯康星大学麦迪逊分校 · 生物医学多模态AI

I study reliable biomedical multimodal AI, with a focus on evidence-grounded generation, uncertainty-aware evaluation, and latent reasoning. My current work includes MARIE, a 3D CT report-generation framework under review at ACM Multimedia 2026, and a first-author ACL ARR / EMNLP 2026 submission on uncertainty quantification for LLM-based systems. I am also a Summer Research Assistant at Tsinghua University, working on latent reasoning for diffusion language models.

我研究可靠的生物医学多模态AI，关注证据约束生成、不确定性评估与latent reasoning。当前工作包括MARIE：一项面向3D CT报告生成的证据token定位框架，正在 ACM Multimedia 2026 审稿；以及一篇第一作者 ACL ARR / EMNLP 2026 投稿，研究LLM系统的不确定性量化。我目前也在清华大学担任暑期科研助理，研究latent reasoning如何提升diffusion language model的性能。

hwu522@wisc.edu GitHub LinkedIn CV

Research

研究方向

01 · MEDICAL MULTIMODAL AI

01 · 医学多模态AI

Evidence-Grounded 3D CT Report Generation

证据约束的3D CT报告生成

MARIE routes sentence-level 3D CT evidence tokens, exposes only the committed subset through an Evidence Dictionary, and audits laterality/anatomy/depth consistency. → ACM Multimedia 2026 under review; rebuttal completed

MARIE 对3D CT进行句子级证据token路由，通过 Evidence Dictionary 约束生成，并审计左右侧、解剖区域与深度一致性。→ ACM Multimedia 2026 审稿中；rebuttal已结束

02 · RELIABILITY & UQ

02 · 可靠性与不确定性

Source-Aware Evaluation for LLM Systems

面向LLM系统的来源感知评估

First-author ACL ARR / EMNLP 2026 submission benchmarking uncertainty quantification over 45,163 fixed LLM ranking records, with Error@10 detection and paired source-response diagnosis.

第一作者 ACL ARR / EMNLP 2026 投稿：在45,163条固定LLM排序记录上评估UQ信号，包含 Error@10 检测与用户/物品来源诊断。

03 · LATENT REASONING

03 · Latent Reasoning

Diffusion Language Models with Adaptive Latent Compute

自适应latent compute的扩散语言模型

At Tsinghua University, I am developing a proposal-stage framework that trains masked latent token slots as reasoning states and allocates latent compute according to uncertainty.

在清华大学暑研中，我正在推进一个proposal-stage框架：将masked latent token训练为推理状态，并基于不确定性分配latent compute。

04 · AI AGENTS & SPATIAL GROUNDING

04 · AI Agent与空间定位

From Medical AI Agents to VLA Systems

从医疗AI Agent到VLA系统

My long-term direction is to build models that ground generated actions, reports, and decisions in explicit perceptual evidence, spanning medical AI agents, embodied AI, and autonomous systems.

长期方向是让模型把生成的动作、报告和决策锚定到明确感知证据上，覆盖医疗AI Agent、具身AI与自动系统。

Publications & Manuscripts

论文与手稿

Under Review

审稿中

MARIE: Multimodal Routing with Evidence Token Grounding for 3D CT Report Generation

MARIE：基于多模态路由与证据Token定位的3D CT报告生成

Tianyi Chen*, Hao Wu*, TianMing Sha, Sipeng Chen, Junnan Li, Hang Zhou, Xiao Luo — *Co-first Authors

陈天一*, 吴昊*, 沙天明, 陈思朋, 李俊男, 周航, 罗潇 — *共同第一作者

ACM Multimedia 2026 · Submission #4550 · Rebuttal completed, decision pending Under Review 审稿中

Towards Realistic Evaluation of Uncertainty Quantification for LLM-based Recommender Systems

面向LLM推荐系统不确定性量化的真实评估

Hao Wu, Tianyi Chen, Sipeng Chen, Hang Zhou, Xiao Luo — First Author

吴昊, 陈天一, 陈思朋, 周航, 罗潇 — 第一作者

ACL ARR 2026 May · Preferred venue: EMNLP 2026 · Submission #14271 Under Review 审稿中

GOING: Generative Mode Switching with Confidence Gain for Real-World RAG Noise Mitigation

GOING：基于置信度增益的生成模式切换以缓解真实场景RAG噪声

Hao Wu, Tianyi Chen, Sipeng Chen, Hang Zhou, Xiao Luo — First Author

吴昊, 陈天毅, 陈思鹏, 周航, 罗潇 — 第一作者

ACM Transactions on Intelligent Systems and Technology · 2026 Under Review 审稿中

Ongoing / In Preparation

进行中 / 准备中

Reasoning-Aware Latent Diffusion for Diffusion Language Models

面向扩散语言模型的Reasoning-Aware Latent Diffusion

Hao Wu, [collaborators-First Author] — Tsinghua Summer RA; proposal-stage project

吴昊, [合作者-第一作者] — 清华大学；proposal-stage项目

Latent reasoning · Diffusion language models · 2026 Ongoing 进行中

Reliable Multi-Property Molecule Optimization with Instruction-Tuned LLMs for Drug Discovery

面向药物发现的指令微调LLM多属性分子优化

Hao Wu, [collaborators] — Co-Author

吴昊, [合作者] — 共同作者

Target: AAAI / BIBM· In preparation In Prep 准备中

GAN-Based Stock Price Prediction Using a GRU-CNN Architecture

基于GRU-CNN架构的GAN股价预测

Hao Wu — First Author

吴昊 — 第一作者

Manuscript in preparation · 2023 In Prep 准备中

Published

已发表

Forecast of Bond Issuance Based on ESG Score

基于ESG评分的债券发行预测

Hao Wu et al. — Co-first Author

吴昊等 — 共同第一作者

Academic Journal of Mathematical Sciences · 2023 · 2,400+ views Published 已发表

Experience

经历

Tsinghua University

清华大学

SUMMER RESEARCH ASSISTANT — LATENT REASONING FOR DIFFUSION LANGUAGE MODELS

暑期科研助理 — 面向扩散语言模型的LATENT REASONING

Developing a proposal-stage framework, Reasoning-Aware Latent Diffusion (RALD), that trains masked latent token slots as reasoning states and adaptively allocates latent compute based on uncertainty. Targeting reasoning benchmarks such as Sudoku, Zebra, Countdown, GSM8K, and MATH500, with possible extensions to medical VQA and multimodal medical reasoning.

正在推进proposal-stage框架 Reasoning-Aware Latent Diffusion (RALD)：将masked latent token训练为推理状态，并依据不确定性自适应分配latent compute。计划评估Sudoku、Zebra、Countdown、GSM8K、MATH500，并探索医学VQA与多模态医学推理扩展。

Diffusion LMLatent ReasoningAdaptive ComputePyTorch

May. 2026 – Present

Beijing / Remote

北京 / 远程

UW–Madison, Department of Statistics

威斯康星大学麦迪逊分校统计系

UNDERGRADUATE RESEARCH ASSISTANT · Advisor: Prof. Xiao Luo

本科科研助理 · 导师：罗潇教授

Reliable and verifiable AI generation research. Key outputs: MARIE (ACM MM 2026, co-first author, under review; rebuttal completed), a first-author ACL ARR / EMNLP 2026 submission on uncertainty quantification for LLM-based recommendation, and GOING (TIST, first author, under review). Focus on evidence-committed generation, source-aware reliability evaluation, and training-free RAG robustness.

可靠且可验证的AI生成研究。核心成果：MARIE（ACM MM 2026，共同第一作者，审稿中；rebuttal已结束）、一篇第一作者ACL ARR / EMNLP 2026不确定性量化投稿，以及GOING（TIST，第一作者，审稿中）。研究重点包括证据承诺生成、来源感知可靠性评估与免训练RAG鲁棒性。

3D CTMedical VLMRAGUQLlama-3SwinUNETR

Sep. 2025 – Present

Madison, WI

麦迪逊

Guo Dao Asset Management Co., Ltd.

国道资产管理有限公司

DATA ENGINEERING INTERN — QUANTITATIVE TRADING

数据工程实习生（量化交易）

Built high-throughput real-time data pipelines for HFT with Python & SQL. Managed a 10M RMB simulation system validating ML-based trend-tracking strategies under volatile market conditions.

用Python & SQL构建高吞吐实时数据管道用于高频交易；管理千万规模仿真系统，验证ML趋势跟踪策略。

PythonSQLHFTBacktestingQMT

Jun. – Aug. 2025

Shanghai

上海

Dong Hai Securities Co., Ltd.

东海证券有限责任公司

DATA ANALYST INTERN

数据分析实习生

Automated preprocessing pipelines for 500+ companies with Pandas; scraped and structured financial logs from Wind/Flush databases.

用Pandas为500余家公司自动化数据预处理；从Wind/Flush抓取并结构化金融日志。

PythonPandasWind/Flush

Jul. – Aug. 2023

Changzhou

常州

UW–Madison, Department of Statistics

威斯康星大学麦迪逊分校统计系

DATA SCIENCE PEER MENTOR

数据科学朋辈导师

Weekly office hours mentoring undergraduates in ML, statistics, and Python/R.

每周答疑，辅导本科生机器学习、统计学与Python/R。

Jul. 2025 – Present

Madison, WI

麦迪逊

Projects

项目

MadNote — AI-Powered Academic Discovery Platform

MadNote — AI驱动的学术论文发现平台

MadData 2026 Hackathon

BACKEND & AI DEVELOPER · dotData @ UW–Madison · Feb. 2026

后端与AI开发 · dotData @ UW–Madison · 2026年2月

MadNote transforms research discovery through a fully backend-driven AI pipeline. Built a FastAPI service unifying abstracts, categories, keywords, and interaction metadata. Engineered a multi-round "Discover For You" bubble onboarding flow — users progressively select topics, seed keywords, and expanded keywords — feeding into a personalized weighted ranking engine. Integrated Mistral-7B + RAG for the "Ask Paper" chat box and data analsis+conclusion, enabling per-paper Q&A with robust fallback handling. Designed a BERT-powered keyword extraction layer driving a knowledge graph similarity navigator. Deployed on Vercel (frontend) + Render (backend) with cross-origin auth and cookie management.

MadNote通过完全后端驱动的AI流程重塑论文发现体验。构建FastAPI服务，统一加载摘要、分类、关键词及交互元数据。设计多轮"Discover For You"气泡引导——用户逐步选择话题、种子关键词与扩展关键词，驱动个性化加权排序。集成Mistral-7B + RAG实现"Ask Paper"论文问答和整体的数据分析总结，具备完善降级处理。BERT关键词提取层驱动知识图谱相似度导航。前端Vercel + 后端Render部署，处理跨域认证与Cookie管理。

Ranking: score = 0.45 × topic_match + 0.35 × keyword_overlap + 0.15 × freshness + 0.05 × popularity

排序公式：score = 0.45 × 话题匹配 + 0.35 × 关键词重叠 + 0.15 × 新鲜度 + 0.05 × 热度

FastAPIMistral-7BRAGBERTReact + ViteVercelPython

↗ Live Demo ⌥ GitHub ⬡ Devpost

ESG Bond Issuance Forecasting

ESG债券发行预测研究

Published · 2,400+ views

RESEARCHER & CO-FIRST AUTHOR · Mar. – May 2023

研究员 & 共同第一作者 · 2023年3月–5月

Established an ML framework assessing how ESG scores influence bond issuance. Benchmarked XGBoost, LightGBM, and KNN on high-dimensional financial features; evaluated with cross-validation, RMSE, MAE, and R².

建立ML框架评估ESG评分对债券发行的影响。对高维金融特征基准测试XGBoost、LightGBM和KNN；通过交叉验证、RMSE、MAE和R²评估。

XGBoostLightGBMKNNCross-validationESG

Education

教育背景

University of Wisconsin–Madison

威斯康星大学麦迪逊分校

B.S. Data Science & Information Science (Double Major)

数据科学与信息科学双专业理学学士

GPA 4.0/4.0 · L&S Dean's ListGPA 4.0/4.0 · 院长名单

Sep.2024 – Dec. 2026

Madison, WI

麦迪逊

Xi'an Jiao Tong–Liverpool University (XJTLU)

西交利物浦大学

Year 1–2, Financial Math · Transferred to UW–Madison 2024

大一至大二，金融数学 · 2024年转至UW–Madison

University Excellence Award校级优秀奖

2022 – 2024

Suzhou, China

苏州

Technical Skills

技术技能

Programming编程语言Python · R · MySQL · MongoDB · MATLAB

ML / Data机器学习PyTorch · Scikit-learn · Pandas · NumPy · XGBoost · LightGBM · GANs · Spark · HDFS · R/Shiny

Biomedical AI医学AI3D CT Report Generation · Medical VLMs · Evidence Grounding · SwinUNETR-style 3D Encoders · SimpleITK / NIfTI · Medical VQA Planning

LLM Engineering大模型工程Hugging Face · RAG · FastAPI · Prompt Engineering · Multi-agent · Llama-3 · Mistral · BERT · Diffusion LM · Latent Reasoning

Finance Tools金融工具Bloomberg · Wind-Stock-DB · QMT · Financial Modeling · Stock APIs

Languages语言 English (Fluent) · Mandarin (Native) · Spanish (Elementary) 英语（流利）· 普通话（母语）· 西班牙语（入门）