Zihan Chen, Ph.D. Candidate in Data Science
Bio
Zihan Chen is a final-year data science Ph.D. student, whose research focuses on fintech, information systems (IS) and graph representation learning. His work has been published in top IS and operations management conferences. Zihan also works as a research intern at leading financial companies (Jefferies, AllianceBernstein) and tech firms (Amazon).
Skillset
Zihan has a solid background in statistics, econometrics, and information systems. He is also proficient in algorithms and data structures (Python, SQL, R), and familiar with Linux and AWS environments. He has been a course instructor for multiple Master's level courses in business analysis, mathematics, and web mining, receiving median course evaluation scores of 5/5.
Dissertation Summary
Three essays on graph-based deep learning techniques in business analytics
The matrix of today’s world is fundamentally graph-based: from the interconnected web of global supply chains to the intricacies of social networks, and from the sprawling expanse of online consumer behavior to the intertwined organizational collaborations. These graph structures encapsulate a wealth of patterns and insights waiting to be decoded. Although deep learning models have become crucial for managing the vast volumes of data that modern commerce produces, the potential of graph-based deep learning techniques remains largely untapped. A significant challenge is the opacity of these models, which hampers informed decision-making, as stakeholders in the business realm grapple with the complexities of models they cannot elucidate. Consequently, there is a pressing need to architect explainable graph-based deep learning models to foster better, data-driven decisions.
To address the identified gaps, this dissertation proposal explores three distinct yet inherently connected avenues of graph-based deep learning techniques in the business domain. In essay 1, we leverage graph structures to model interrelations among e-commerce products. We introduce a novel Graph Neural Network (GNN) that amalgamates both temporal data—historical sales of products—and spatial data—interrelations among products—for future sales prediction. Recognizing that product associations can either be transient or enduring, our model integrates both through the incorporation of static and dynamic attention mechanisms. This dual attention approach not only enables our model to surpass benchmark performance on a real-world dataset but also offers insights into how forecasts are made using aggregated information from similar products.
In essay 2, we develop a theory-driven GNN model wherein information dissemination follows paths delineated by social network theory. We unveil the first-ever GNN that employs user stance as an edge feature. Recognizing that user stances can sculpt unique echo chambers, we delineate four theory-informed information propagation paths for the GNN to assimilate information. The model demonstrates commendable efficacy in proactively predicting misinformation dissemination in online communities. Moreover, the attention scores of each path illuminate the varying influences of different echo chambers on the misinformation propagation trajectory.
In essay 3, we model the hierarchical graph structures inherent to online discussion threads. We focus particularly on bot approval processes on Wikipedia, characterized by dialogues between bot operators and the bot approval group, with the outcomes determining the bot’s operational viability. While current literature has extensively scrutinized online bot detection and roles, the origination of bots remains a less-trodden avenue. Preliminary experiments suggest that when juxtaposed with models relying solely on plain text, the inclusion of discussion structure considerably enhances model efficacy in predicting approval outcomes.
Academic Advisor
Jingyi Sun