Department of Computer Science, Queens College, CUNY

Welcome to the Q4C (QForce) Series

Queens For Computing
Queens College CUNY Computer Science Colloquium

This colloquium is intended to bring together Computer Science and Data Science researchers in the tri-state area (especially in NYC) and to foster collaboration. We welcome talks on any topic of interest to the CS community, including theory, algorithms, machine learning, and data science. If you are interested in attending in-person or online, or would like to give a talk, please contact the organizers.

Monday, 09/15/2025, 12:15PM - 1:30PM
Room: Science Building, C205
Speaker: Zining Zhu, Stevens Institute of Technology
Title: Improving LLM reasoning with mechanistic interpretability insights
Abstract: LLMs have shown great success in tasks requiring reasoning skills, and LLM-based agentic reasoning systems have been used in scenarios including explanation, coding, problem-solving, among many others. Meanwhile, many improvement areas have also been identified, and we look forward to improving the reasoning capabilities. In recent years, mechanistic interpretability offers useful tools and intuitions. In this talk, I introduce some of our recent attempts to apply the tools and intuitions to improving LLM reasoning, and share some exciting preliminary findings along this avenue.
Zining is an assistant professor at Stevens Institute of Technology department of computer science. He directs the Explainable and Controllable AI lab. The lab’s research involve understanding the mechanisms and abilities of AIs, and incorporating the findings into controlling the AIs. Zining looks forward to building safe, trustworthy agentic AIs that can assist humans to discover knowledge and better perform high-stake tasks. Zining has received paper award at NAACL.
Monday, 09/29/2025, 12:15PM - 1:30PM
Room: Science Building, C205
Speaker: Hanfei Yu, Stevens Institute of Technology
Title: Serverless Computing for AI Systems
Abstract: Serverless computing is emerging as the next-generation computing paradigm, bridging HPC and cloud cyberinfrastructures with its ease of deployment, instant scalability, and pay-as-you-go pricing. As AI reshapes industries and academia, serverless computing is increasingly explored for large-scale AI training and inference. However, traditional serverless architectures are not optimized for AI workloads, introducing critical performance bottlenecks that hinder their direct applicability. Rethinking serverless computing from an algorithm-system co-design perspective is essential to unlocking its full potential for AI systems.
In this talk, I will present two case studies—Stellaris and FineMoE—that demonstrate the necessity of algorithm-system co-design for serverless AI workloads. Stellaris introduces a generic asynchronous learning paradigm for distributed deep reinforcement learning (DRL) training, leveraging serverless computing to achieve higher efficiency and lower costs. FineMoE optimizes Mixture-of-Experts (MoE) serving through fine-grained expert offloading, significantly improving memory efficiency while maintaining low inference latency. I will also discuss key challenges and future directions in serverless computing for AI, highlighting opportunities for optimizing AI workloads across cloud and HPC environments.
Bio: Hanfei Yu is a fifth-year Ph.D. student in the Department of Electrical and Computer Engineering at Stevens Institute of Technology, advised by Prof. Hao Wang. He received his M.S. in Computer Science and Systems from the University of Washington Tacoma and his B.S. in Electronic Engineering from Shanghai Jiao Tong University. Hanfei's research focuses on serverless computing, reinforcement learning systems, LLM serving, and large-scale AI/ML systems. His work aims to develop efficient serverless AI ecosystems that integrate cloud and HPC resources to optimize AI workloads. His research has explored AI/ML-driven techniques to enhance serverless computing efficiency and the design of optimized serverless infrastructures for AI training and inference. He was a Research Intern at Microsoft Azure Research Systems and the Microsoft M365 Systems Innovation Group. Hanfei's contributions have been recognized with the ACM SoCC'24 Best Paper Award and as an ACM/IEEE SC'24 Best Student Paper Finalist. He was also selected as one of the 2025 MLCommons ML and Systems Rising Stars.
Monday, 10/06/2025, 12:15PM - 1:30PM
Room: Science Building, C205
Speaker: Jonathan Gryak, Queens College CUNY
Title: An Introduction to Data-driven Model Discovery
Abstract: Many real-world phenomena arising in engineering, biomedicine, and finance can be modeled as nonlinear dynamical systems. Governing equations for such systems have typically been discovered by utilizing expert domain knowledge combined with experimental data. Our current era of high-dimensional and large-scale experimental data has motivated the need to develop tools that can accelerate the scientific discovery process. Data-driven model discovery (MD) aims to use measurement data and machine algorithms to infer interpretable dynamical system models.
In this presentation I will provide an introduction to data-driven model discovery, including an overview of existing methods. I will also present the results of MDBench, a new unified benchmarking platform for assessing MD methods on systems of ordinary and partial differential equations. Finally, I will discuss the current challenges in data-driven model discovery that, if addressed, can achieve its potential for advancing scientific inquiry.
Dr. Jonathan Gryak is an Assistant Professor of Computer Science and Data Science at Queens College and the Graduate Center, CUNY. As the principal investigator of the Interdisciplinary Data Science Lab (IDSL), Dr. Gryak and his team work to solve problems in biomedical informatics and computer science by developing novel artificial intelligence (AI)/machine learning (ML) methods that can leverage a problem's underlying morphology and work within its operational limitations. Prior to joining CUNY, Dr. Gryak was the Senior Scientist for the Michigan Institute of Data Science and a Research Scientist in the Department of Computational Medicine and Bioinformatics at the University of Michigan, Ann Arbor.
Wednesday, 10/22/2025, 12:15PM - 1:30PM
Room: Science Building, C205
Speaker: Peter Heller, Queens College CUNY
Title: Domain-Driven Database Design (D⁴): Standardizing Horizontally with Fully Qualified Domains (FQD) Across a Heterogeneous Database environment
Abstract: Modern enterprises struggle with siloed data, inconsistent standards, and governance gaps that undermine trust and scalability. Domain-Driven Database Design (D⁴) offers a disruptive alternative by embedding business semantics directly into the physical data model (PDM) through Fully Qualified Domains (FQD) and Fully Qualified Table Names (FQTN). These constructs create a horizontally standardized foundation where domains are reusable, schemas are consistent, and governance is embedded at the database layer.
This talk will explore how D⁴ integrates with a Business Domain Glossary (BDG) and how Knowledge Graphs can be leveraged to capture semantic relationships, automate domain discovery, and generate enforceable FQDs. By applying the Single Responsibility Principle (SRP) from SOLID Design Principles, FQDs encapsulate clear business meaning, default values, and validation rules—driving systemic clarity and reducing redundancy across a heterogeneous database environment (SQL Server, PostgreSQL, and beyond).
Bio: Peter Heller is an Adjunct Lecturer in Computer Science at Queens College (CUNY), where he teaches courses on SQL Server, Business Intelligence, and Data Modeling. He is the creator of D⁴ – Domain-Driven Database Design, a metamodeling framework that embeds governance, semantics, and standardization directly into physical database architectures.
Peter previously served as a Computer Specialist Level IV / Solutions Architect for the City of New York’s Department of Citywide Administrative Services, where he managed the $800M EC3 energy-cost control system, earning a NYC Excellence in Technology Project Management Leadership Award. He has presented at Data Modeling Zone, published in SQL Server Pro, and is an active member of professional groups including the NYC Erwin Modeling Group and LinkedIn’s Modern Excel community.
In addition to teaching, Peter regularly publishes articles on LinkedIn and Medium.com, engages in scholarly discussions by responding to industry articles, and is continually exploring new technologies to bring cutting-edge knowledge into his classrooms. His work bridges academia, industry, and governance, with a focus on transforming metadata into an enforceable, intelligent layer that drives enterprise clarity and AI-ready data ecosystems
Monday, 11/03/2025, 12:15PM - 1:30PM
Room: Science Building, C205
Speaker: Yifan Hu, Rutgers University
Title: Contextual Stochastic Bilevel Optimization and Reinforcement Learning
Abstract: We introduce contextual stochastic bilevel optimization (CSBO) - a stochastic bilevel optimization framework with the lower-level problem minimizing an expectation conditioned on contextual information and on the upper-level decision variable. We also assume that there may be multiple (or even infinitely many) followers at the lower level. CSBO encapsulates important applications such as meta-learning, personalized federated learning, end-to-end learning, and Wasserstein distributionally robust optimization with side information as special cases. Due to the contextual information at the lower level, existing single-loop methods for classical stochastic bilevel optimization are not applicable. We thus propose an efficient double-loop gradient method based on the Multilevel Monte-Carlo (MLMC) technique. When specialized to stochastic nonconvex optimization, the sample complexity of our method matches existing lower bounds. We further discuss contextual bilevel reinforcement learning which finds various applications from model design, mechanism design, and preference optimization.
Bio: Yifan Hu is an assistant professor in the Department of Statistics, Rutgers University (New Brunswick, NJ). He was as a postdoc researcher at EPFL and ETH Zurich in Switzerland from 2022 to 2025. Prior to that, he obtained PhD in Operations Research from University of Illinois at Urbana-Champaign in 2022. His research interests lie in optimal decision-making under uncertainty, with an intersection of optimization, statistics, and operations research. In particular, he studies problems arising from reinforcement learning, preference optimization, causal inference and discovery, and operations management, aiming to build new models and develop simple-to-implement methods with provable guarantees.
Monday, 11/17/2025, 12:15PM - 1:30PM
Room: Science Building, C205
Speaker: Santosh Borse, IBM Research
Title: Models Are What They Eat: Data Engineering Challenges and Solutions
Abstract: Data Quality is the single most important key driver of LLM model performance. Behind every successful model lies a complex data engineering pipeline that transforms noisy, unstructured data into clean, balanced, and trustworthy training data. This session explores real-world challenges in large scale data processing like handling massive raw data, identifying and filtering low-quality or biased content to ensuring reproducibility and observability in distributed environments. Santosh will introduce the Data Prep Kit (DPK) — an open-source toolkit designed to make LLM data preparation scalable, modular, and transparent.
IBM Research team has recently open sourced a 10 Trillion token dataset’s recipe called GneissWeb. In the hands-on portion of the session, Santosh will guide participants through reproducing a smaller version of the GneissWeb dataset using DPK. Attendees are encouraged to bring their laptops and follow along to gain practical experience in building LLM-ready data pipelines.
Bio: Santosh Borse is a Senior Engineer at IBM Research, where he focuses on large-scale data engineering for pre- and post-training of foundation models in the IBM Granite series. With over 20 years of experience in software development, architecture, and applied AI. Santosh is passionate about solving complex problems through innovation and practical engineering. Santosh holds 11 granted patents across AI, NLP, IoT, and cloud technologies, and shares his insights regularly on medium.com/@sanborse. He holds a Master’s degree in Computer Science.
LinkedIn: www.linkedin.com/in/santoshborse/
Monday, 12/01/2025, 12:15PM - 1:30PM
Room: Science Building, C205
Speaker: Jia Xu, Stevens Institute of Technology
Title: Is Long Context a Bottleneck? ResFormer to Learn in Linear Time.
Abstract: Recent advances in natural language processing (NLP), such as large language models (LLMs), have transformed many domains by achieving remarkable improvements through scaling model size, training time, and data volume. However, this progress comes with steep computational costs, which limit scalability and accessibility. In this talk, I will present ResFormer, a novel approach that goes beyond brute-force scaling by leveraging the computational efficiency of reservoir computing. ResFormer captures the full temporal context and learns long-range dependencies in sequential data, achieving higher prediction accuracy while improving computational efficiency. At its core, ResFormer combines reservoir computing, which models ultra-long-term dependencies with linear complexity, with a Transformer module specialized for short-term context. This hybrid design overcomes the quadratic complexity bottleneck of traditional Transformers, enabling efficient processing of long sequences without prohibitive computational cost, and demonstrates that smaller, faster models can even outperform larger ones, paving the way for efficient and scalable model design.
Bio: I am a faculty member at Stevens Institute of Technology, specializing in natural language processing and machine learning. Previously, I served as an Associate Professor at the Chinese Academy of Sciences and as an Assistant Professor and Ph.D. advisor at Tsinghua University. I earned my Ph.D. from RWTH Aachen University under Prof. Hermann Ney, with research visits to Microsoft Research and IBM Watson. My work includes approximately 50 publications and contributions to 18 top-ranking teams in major NLP competitions, including second place in the Amazon Alexa Prize Social Bot Challenge.

The seminar is organized by Jun Li
Email Contact: jun.li@qc.cuny.edu

Welcome to the Q4C (QForce) Series

Queens For Computing Queens College CUNY Computer Science Colloquium

Queens For Computing
Queens College CUNY Computer Science Colloquium