Hao (Edison) Wu

吴昊

UNDERGRADUATE RESEARCHER  ·  UW–MADISON  ·  GPA 4.0/4.0

本科科研  ·  威斯康星大学麦迪逊分校  ·  GPA 4.0/4.0

I study reliable and verifiable AI generation,, with a focus on when models should be held accountable to the evidence they cite and how that accountability can be tested. My current work spans RAG robustness and grounded 3D medical report generation, with one paper under review at ACM Multimedia 2026 and another being prepared for submission to ACM Transactions on Intelligent Systems and Technology. In the longer term, I am interested in extending evidence-grounded and spatial grounding methods to medical AI, embodied AI, and autonomous driving.

我研究可靠且可验证的AI生成, 关注模型何时应对其引用的证据负责,以及我们如何验证这种责任。当前工作涵盖RAG鲁棒性与有依据的3D医学报告生成,目前一篇论文在 ACM MM 2026 审稿中,另一篇正投稿至 ACM TIST。长期来看,我希望将这类基于证据约束与空间定位的生成方法进一步延伸到医疗、具身AI与自动驾驶场景。

Hao Wu

Research

研究方向

01 · RAG ROBUSTNESS
01 · RAG鲁棒性
Confidence-Based Circuit Breakers
基于置信度的电路断路器

Token-level confidence gain as a training-free signal for detecting toxic retrieval and switching generation modes. → GOING, ACM TIST (in submission)

以 token 级置信度增益作为免训练信号,检测有害检索并切换生成模式。→ GOING, ACM TIST(投稿中)

02 · AI FOR MEDICAL & DRUG DISCOVERY
02 · 医疗AI与药物发现
Verifiable 3D Reporting & Reliable Molecule Optimization
可验证 3D 医学报告生成与可靠分子优化

Every generated sentence must cite the exact spatial tokens it draws from, enforcing evidence commitment by construction. → MARIE, ACM MM '26. Extending the same reliability lens to multi-property molecule optimization with instruction-tuned LLMs for lead optimization in drug discovery — score-free candidate selection when property predictors are noisy. → NeurIPS / AAAI '26 (in preparation). The token-gating idea also parallels grounding VLA planning outputs in BEV tokens for autonomous driving and robotic manipulation.

每条生成句子须引用其所依赖的空间 token,从设计层面保障证据承诺。→ MARIE, ACM MM '26。同一可靠性视角也延伸至药物发现中基于指令微调 LLM 的多属性分子优化——在属性预测噪声下进行无评分候选选择。→ NeurIPS / AAAI '26(准备中)

03 · QUANTITATIVE FINANCE
03 · 量化金融
Deep Learning for Markets
深度学习与金融市场

GAN/GRU-CNN architectures for stock prediction; ESG-driven bond issuance forecasting with XGBoost and LightGBM.

GAN/GRU-CNN 架构用于股价预测;基于 XGBoost 与 LightGBM 的 ESG 债券发行预测。

04 · EMBODIED AI & AUTONOMOUS SYSTEMS
04 · 具身AI与自动驾驶
Spatial Grounding → VLA & Embodied AI
空间定位 → VLA与具身智能

A future direction I am actively pursuing: applying spatial token grounding techniques to medical AI, embodied AI, and autonomous driving, where VLA models face the same core challenge of anchoring planning outputs to explicit perceptual evidence. The MARIE evidence commitment mechanism provides a direct conceptual bridge.

我正在积极探索的未来方向:将空间 token 定位技术延伸至医疗、具身 AI 与自动驾驶领域。VLA 模型面临与 MARIE 相同的核心挑战,即如何将规划输出锚定于明确的感知证据;MARIE 的证据承诺机制为此提供了直接的概念桥梁。

Publications & Manuscripts

论文与手稿

Under Review
审稿中
GOING: Generative Mode Switching with Confidence Gain for Real-World RAG Noise Mitigation
GOING: 基于置信度增益的生成模式切换以缓解真实场景 RAG 噪声
Hao Wu, Tianyi Chen, Sipeng Chen, Hang Zhou, Xiao Luo
吴昊, 陈天毅, 陈思鹏, 周航, 罗潇
ACM Transactions on Intelligent Systems and Technology (TIST) · Research Note · 2026 Under Review 审稿中 Code
MARIE: Multimodal Routing with Evidence Token Grounding for 3D CT Report Generation
MARIE:基于多模态路由与证据Token定位的3D CT报告生成
Tianyi Chen*, Hao Wu*, TianMing Sha, Sipeng Chen, Junnan Li, Hang Zhou, Xiao Luo — *Co-first Authors
陈天一*, 吴昊*, 沙天明, 陈思朋, 李俊男, 周航, 罗潇 — *共同第一作者
ACM Multimedia 2026 · Submission #4550 · Apr. 2026 Under Review 审稿中
Published
已发表
Forecast of Bond Issuance Based on ESG Score
基于ESG评分的债券发行预测
Hao Wu et al. — Co-first Author
吴昊— 共同第一作者
Academic Journal of Mathematical Sciences · 2023 · 2,400+ views Published 已发表
Manuscript/In Preparation
准备中
[Title Withheld] — Benchmark & Evaluation Framework for Robustness and Uncertainty in Language-Grounded AI Systems
[标题暂略] — 面向语言基座 AI 系统的鲁棒性与不确定性评估基准
Hao Wu, [Co-author(s)] — Co-author; benchmark design, evaluation framework & infrastructure
吴昊, [合作者] — 合作者;负责基准设计、评估框架与基础设施
Target: EMNLP 2026 · ARR May 2026 In Prep 准备中
[Title Withheld] — Reliable Multi-Property Molecule Optimization with Instruction-Tuned LLMs for Drug Discovery
[标题暂略] — 面向药物发现的指令微调 LLM 多属性分子优化
Hao Wu, [Co-author(s)] — First Author
吴昊, [合作者] — 第一作者
Target: NeurIPS 2026 / AAAI 2027 · Submission window May–Aug. 2026 In Prep 准备中
GAN-Based Stock Price Prediction Using a GRU-CNN Architecture
基于GRU-CNN架构的GAN股价预测
Hao Wu — First Author
吴昊 — 第一作者
Manuscript in preparation · 2023 In Prep 准备中

Experience

经历

UW–Madison, Department of Statistics
威斯康星大学麦迪逊分校统计系
UNDERGRADUATE RESEARCH ASSISTANT  ·  Advisor: Prof. Xiao Luo
本科科研助理  ·  导师:罗潇 教授

RAG robustness and hallucination mitigation research. Key outputs: MARIE (ACM MM '26, co-first author, under review) and GOING (ACM TIST Research Note, first author, in submission). Focus on training-free confidence-based filtering and token-level evidence commitment for grounded generation. Currently leading a first-author manuscript on reliable multi-property molecule optimization with instruction-tuned LLMs for drug discovery (NeurIPS / AAAI '26 in preparation).

RAG鲁棒性与幻觉缓解研究。核心成果:MARIE(ACM MM '26共同第一作者,审稿中)与GOING(ACM TIST Research Note第一作者,投稿中)。核心:免训练置信度过滤与token级证据承诺。当前正主导一项第一作者工作——面向药物发现的指令微调 LLM 多属性分子优化(NeurIPS / AAAI '26 准备中)。

PyTorchHugging FaceLlama-3-8BRAGColBERTv2
Sep. 2025 – Present
Madison, WI
麦迪逊
Guo Dao Asset Management Co., Ltd.
国道资产管理有限公司
DATA ENGINEERING INTERN — QUANTITATIVE TRADING
数据工程实习生(量化交易)

Built high-throughput real-time data pipelines for HFT with Python & SQL. Managed a 10M RMB simulation system validating ML-based trend-tracking strategies under volatile market conditions.

用Python & SQL构建高吞吐实时数据管道用于高频交易;管理千万规模仿真系统,验证ML趋势跟踪策略。

PythonSQLHFTBacktestingQMT
Jun. – Aug. 2025
Shanghai
上海
Dong Hai Securities Co., Ltd.
东海证券有限责任公司
DATA ANALYST INTERN
数据分析实习生

Automated preprocessing pipelines for 500+ companies with Pandas; scraped and structured financial logs from Wind/Flush databases.

用Pandas为500余家公司自动化数据预处理;从Wind/Flush抓取并结构化金融日志。

PythonPandasWind/Flush
Jul. – Aug. 2023
Changzhou
常州
UW–Madison, Department of Statistics
威斯康星大学麦迪逊分校统计系
DATA SCIENCE PEER MENTOR
数据科学朋辈导师

Weekly office hours mentoring undergraduates in ML, statistics, and Python/R.

每周答疑,辅导本科生机器学习、统计学与Python/R。

Jul. 2025 – Present
Madison, WI
麦迪逊

Projects

项目

MadNote — AI-Powered Academic Discovery Platform
MadNote — AI驱动的学术论文发现平台
MadData 2026 Hackathon
BACKEND & AI DEVELOPER · dotData @ UW–Madison · Feb. 2026
后端与AI开发 · dotData @ UW–Madison · 2026年2月

MadNote transforms research discovery through a fully backend-driven AI pipeline. Built a FastAPI service unifying abstracts, categories, keywords, and interaction metadata. Engineered a multi-round "Discover For You" bubble onboarding flow — users progressively select topics, seed keywords, and expanded keywords — feeding into a personalized weighted ranking engine. Integrated Mistral-7B + RAG for the "Ask Paper" chat box and data analsis+conclusion, enabling per-paper Q&A with robust fallback handling. Designed a BERT-powered keyword extraction layer driving a knowledge graph similarity navigator. Deployed on Vercel (frontend) + Render (backend) with cross-origin auth and cookie management.

MadNote通过完全后端驱动的AI流程重塑论文发现体验。构建FastAPI服务,统一加载摘要、分类、关键词及交互元数据。设计多轮"Discover For You"气泡引导——用户逐步选择话题、种子关键词与扩展关键词,驱动个性化加权排序。集成Mistral-7B + RAG实现"Ask Paper"论文问答和整体的数据分析总结,具备完善降级处理。BERT关键词提取层驱动知识图谱相似度导航。前端Vercel + 后端Render部署,处理跨域认证与Cookie管理。

Ranking: score = 0.45 × topic_match + 0.35 × keyword_overlap + 0.15 × freshness + 0.05 × popularity
排序公式:score = 0.45 × 话题匹配 + 0.35 × 关键词重叠 + 0.15 × 新鲜度 + 0.05 × 热度
FastAPIMistral-7BRAGBERTReact + ViteVercelPython
ESG Bond Issuance Forecasting
ESG债券发行预测研究
Published · 2,400+ views
RESEARCHER & CO-FIRST AUTHOR · Mar. – May 2023
研究员 & 共同第一作者 · 2023年3月–5月

Established an ML framework assessing how ESG scores influence bond issuance. Benchmarked XGBoost, LightGBM, and KNN on high-dimensional financial features; evaluated with cross-validation, RMSE, MAE, and R².

建立ML框架评估ESG评分对债券发行的影响。对高维金融特征基准测试XGBoost、LightGBM和KNN;通过交叉验证、RMSE、MAE和R²评估。

XGBoostLightGBMKNNCross-validationESG

Education

教育背景

University of Wisconsin–Madison
威斯康星大学麦迪逊分校
B.S. Data Science & Information Science (Double Major)
数据科学与信息科学双专业理学学士
GPA 4.0/4.0 · L&S Dean's ListGPA 4.0/4.0 · 院长名单
Sep.2024 – Dec. 2026
Madison, WI
麦迪逊
Xi'an Jiao Tong–Liverpool University (XJTLU)
西交利物浦大学
Year 1–2, Financial Math · Transferred to UW–Madison 2024
大一至大二,金融数学 · 2024年转至UW–Madison
University Excellence Award校级优秀奖
2022 – 2024
Suzhou, China
苏州

Technical Skills

技术技能

Programming编程语言Python · R · MySQL · MongoDB · MATLAB
ML / Data机器学习PyTorch · Scikit-learn · Pandas · NumPy · XGBoost · LightGBM · GANs · Spark · HDFS · R/Shiny
LLM Engineering大模型工程Hugging Face · RAG · FastAPI · Prompt Engineering · Multi-agent · Llama-3 · Mistral · BERT
Finance Tools金融工具Bloomberg · Wind-Stock-DB · QMT · Financial Modeling · Stock APIs
Languages语言 English (Fluent) · Mandarin (Native) · Spanish (Elementary) 英语(流利)· 普通话(母语)· 西班牙语(入门)