UNDERGRADUATE RESEARCHER · UW–MADISON · GPA 4.0/4.0
本科科研 · 威斯康星大学麦迪逊分校 · GPA 4.0/4.0
I study reliable and verifiable AI generation,, with a focus on when models should be held accountable to the evidence they cite and how that accountability can be tested. My current work spans RAG robustness and grounded 3D medical report generation, with one paper under review at ACM Multimedia 2026 and another being prepared for submission to ACM Transactions on Intelligent Systems and Technology. In the longer term, I am interested in extending evidence-grounded and spatial grounding methods to medical AI, embodied AI, and autonomous driving.
我研究可靠且可验证的AI生成, 关注模型何时应对其引用的证据负责,以及我们如何验证这种责任。当前工作涵盖RAG鲁棒性与有依据的3D医学报告生成,目前一篇论文在 ACM MM 2026 审稿中,另一篇正投稿至 ACM TIST。长期来看,我希望将这类基于证据约束与空间定位的生成方法进一步延伸到医疗、具身AI与自动驾驶场景。
Token-level confidence gain as a training-free signal for detecting toxic retrieval and switching generation modes. → GOING, ACM TIST (in submission)
以 token 级置信度增益作为免训练信号,检测有害检索并切换生成模式。→ GOING, ACM TIST(投稿中)
Every generated sentence must cite the exact spatial tokens it draws from, enforcing evidence commitment by construction. → MARIE, ACM MM '26. Extending the same reliability lens to multi-property molecule optimization with instruction-tuned LLMs for lead optimization in drug discovery — score-free candidate selection when property predictors are noisy. → NeurIPS / AAAI '26 (in preparation). The token-gating idea also parallels grounding VLA planning outputs in BEV tokens for autonomous driving and robotic manipulation.
每条生成句子须引用其所依赖的空间 token,从设计层面保障证据承诺。→ MARIE, ACM MM '26。同一可靠性视角也延伸至药物发现中基于指令微调 LLM 的多属性分子优化——在属性预测噪声下进行无评分候选选择。→ NeurIPS / AAAI '26(准备中)
GAN/GRU-CNN architectures for stock prediction; ESG-driven bond issuance forecasting with XGBoost and LightGBM.
GAN/GRU-CNN 架构用于股价预测;基于 XGBoost 与 LightGBM 的 ESG 债券发行预测。
A future direction I am actively pursuing: applying spatial token grounding techniques to medical AI, embodied AI, and autonomous driving, where VLA models face the same core challenge of anchoring planning outputs to explicit perceptual evidence. The MARIE evidence commitment mechanism provides a direct conceptual bridge.
我正在积极探索的未来方向:将空间 token 定位技术延伸至医疗、具身 AI 与自动驾驶领域。VLA 模型面临与 MARIE 相同的核心挑战,即如何将规划输出锚定于明确的感知证据;MARIE 的证据承诺机制为此提供了直接的概念桥梁。
RAG robustness and hallucination mitigation research. Key outputs: MARIE (ACM MM '26, co-first author, under review) and GOING (ACM TIST Research Note, first author, in submission). Focus on training-free confidence-based filtering and token-level evidence commitment for grounded generation. Currently leading a first-author manuscript on reliable multi-property molecule optimization with instruction-tuned LLMs for drug discovery (NeurIPS / AAAI '26 in preparation).
RAG鲁棒性与幻觉缓解研究。核心成果:MARIE(ACM MM '26共同第一作者,审稿中)与GOING(ACM TIST Research Note第一作者,投稿中)。核心:免训练置信度过滤与token级证据承诺。当前正主导一项第一作者工作——面向药物发现的指令微调 LLM 多属性分子优化(NeurIPS / AAAI '26 准备中)。
Built high-throughput real-time data pipelines for HFT with Python & SQL. Managed a 10M RMB simulation system validating ML-based trend-tracking strategies under volatile market conditions.
用Python & SQL构建高吞吐实时数据管道用于高频交易;管理千万规模仿真系统,验证ML趋势跟踪策略。
Automated preprocessing pipelines for 500+ companies with Pandas; scraped and structured financial logs from Wind/Flush databases.
用Pandas为500余家公司自动化数据预处理;从Wind/Flush抓取并结构化金融日志。
Weekly office hours mentoring undergraduates in ML, statistics, and Python/R.
每周答疑,辅导本科生机器学习、统计学与Python/R。
MadNote transforms research discovery through a fully backend-driven AI pipeline. Built a FastAPI service unifying abstracts, categories, keywords, and interaction metadata. Engineered a multi-round "Discover For You" bubble onboarding flow — users progressively select topics, seed keywords, and expanded keywords — feeding into a personalized weighted ranking engine. Integrated Mistral-7B + RAG for the "Ask Paper" chat box and data analsis+conclusion, enabling per-paper Q&A with robust fallback handling. Designed a BERT-powered keyword extraction layer driving a knowledge graph similarity navigator. Deployed on Vercel (frontend) + Render (backend) with cross-origin auth and cookie management.
MadNote通过完全后端驱动的AI流程重塑论文发现体验。构建FastAPI服务,统一加载摘要、分类、关键词及交互元数据。设计多轮"Discover For You"气泡引导——用户逐步选择话题、种子关键词与扩展关键词,驱动个性化加权排序。集成Mistral-7B + RAG实现"Ask Paper"论文问答和整体的数据分析总结,具备完善降级处理。BERT关键词提取层驱动知识图谱相似度导航。前端Vercel + 后端Render部署,处理跨域认证与Cookie管理。
Established an ML framework assessing how ESG scores influence bond issuance. Benchmarked XGBoost, LightGBM, and KNN on high-dimensional financial features; evaluated with cross-validation, RMSE, MAE, and R².
建立ML框架评估ESG评分对债券发行的影响。对高维金融特征基准测试XGBoost、LightGBM和KNN;通过交叉验证、RMSE、MAE和R²评估。