About Me

Wei-Hao Chen

Hello, my name is Wei-Hao Chen. I am a third-year Ph.D. student in the Department of Computer Science at Purdue University and a member of the Human-Centered Software Systems Lab under the guidance of Professor Tianyi Zhang. My research primarily focuses on Human-AI Interaction (HAI), Software Engineering, and Artificial Intelligence. I am currently developing tools to help data scientists using several techniques such as Programming-by-Demonstration, LLM, and Artificial Intelligence. Prior to my Ph.D. studies at Purdue, I worked as a software engineer at Realtek Semiconductor Corp in Taiwan. My previous research was centered around Cryptography. In my free time, I enjoy traveling, playing video games 🎮, and I love cats 😺.

Email CV

Research Projects

Web Automation for Data Collection

Web automation is frequently used by data scientists, domain experts, and programmers to complete time-consuming data collection tasks. However, developing web automation scripts requires familiarity with a programming language and HTML, which remains a key learning barrier for non-expert users. We provide MIWA, a mixed-initiative web automation system that enables users to create web automation scripts by demonstrating what content they want from the targeted websites. Compared to existing web automation tools, MIWA helps users better understand a generated script and build trust in it by (1) providing a step-by-step explanation of the script's behavior with visual correspondence to the target website, (2) supporting greater autonomy and control over web automation via step-through debugging and fine-grained demonstration refinement, and (3) automatically detecting potential corner cases that are handled improperly by the generated script. We conducted a within-subjects user study with 24 participants and compared MIWA with Rousillon, a state-of-the-art web automation tool. Results showed that, compared to Rousillon, MIWA reduced the task completion time by half while helping participants gain more confidence in the generated script.

Keywords: Programming by Demonstration, Web Automation, Data Science

Understanding the Coding Behavior of Data Scientists at the Action Level

In the era of big data, Data Science (DS) plays a crucial role in gaining valuable insights from data across various domains. However, our understanding of DS programmers' coding behavior remains insufficient. Previous studies mainly analyzed DS code from public code-sharing platforms such as GitHub and Kaggle, which are limited to code changes committed to the version history, omitting many coding errors that are resolved before code commits. To bridge this gap, we present a comprehensive analysis of the fine-grained logs of a DS hackathon that includes Jupyter Notebooks written by participants for over six weeks. We recorded all code changes and program execution logs, enabling us to identify common programming mistakes and bugs across different data science stages and programmers' debugging behavior. This work enhances our understanding of DS coding errors and debugging practices of DS programmers, highlighting several future opportunities for designing new tool support for DS programming.

Keywords: Data Science, Jupyter Notebook, Programming Behavior

Publications

MIWA: Mixed-Initiative Web Automation for Better User Control and Confidence
Wei-Hao Chen, Xiaoyu Liu, Jiacheng Zhang, Ian Iong Lam, Zhicheng Huang, Rui Dong, Xinyu Wang, Tianyi Zhang
UIST 23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
Efficient Key-Aggregate Proxy Re-Encryption for Secure Data Sharing in Clouds
Wei-Hao Chen, Chun-I Fan, Yi-Fan Tseng
2018 IEEE Conference on Dependable and Secure Computing (DSC)

News

03/2024
I have been selected as the recipient of a Summer Research Grant. Thanks to the Purdue Computer Science Department! 🎉
09/2023
I am Graduate TA for CS 240: Programming in C at Purdue University.
07/2023
Our paper - MIWA: Mixed-Initiative Web Automation for Better User Control and Confidence is accepted by UIST 2023 🎉.
05/2022
I start my internship at Intel Corporation.
01/2022
I am Graduate TA for CS 180: Problem Solving And Object-Oriented Programming.
09/2021
I am Graduate TA for CS 240: Programming in C at Purdue University.
09/2021
I join Purdue University 🚂 as an PhD student in Computer Science.