Career & Research Impact

Presentations & Conferences

(Highlight talks, conference papers, invited presentations)

  • Presented Optimizing Large Language Models (LLMs) for ICU Readmission Prediction, for Underrepresented Populations at Bristol Myers Squibb (BMS) Science Scholars Symposium (2024/25). project-presentation

    • Developed large-scale AI models leveraging NLP to improve predictive analytics for underrepresented populations in healthcare decision-making.

    • Led an industry-focused AI training session at BMS, introducing prompt engineering techniques for AI-driven healthcare analytics.

  • Presented Automated Regulatory Classification of Mobile Medical Apps at IEEE 38th CBMS Conference in Madrid, Spain (June 2025), showcasing NLP pipelines for detecting compliance issues in mobile apps. certificate-of-participation

  • Presented Automated Regulatory Classification of Mobile Medical Apps at Montclair State University symposium, highlighting AI-driven compliance frameworks for healthcare applications. (2024/25, with Prof. Raina Samuel) project-poster

  • Presented ML-GIS Research on Solar Panel Detection & Energy Estimation at New Jersey Big Data Alliance Symposium. (March 2024) project-poster

  • Presented Presented Findings on Virtual Reality in STEM Education at Clean Energy and Sustainability Analytics Center (CESAC) Symposium. (October 2023)


AI & Machine Learning Research

(research, methodologies, impact)

🔹 Graph Neural Networks & AI for Drug Discovery (Data Science Lab - AMIA 2025 Conference) paper

  • Used biomedical knowledge graphs & Graph Neural Networks (GNNs) to improve drug-drug interaction prediction.

  • Integrated biomedical ontologies & structured pharma datasets to enhance AI-driven clinical research.

🔹 Automated Regulatory Classification of Mobile Medical Apps (School of Computing, IEEE - CBMS Conference 2025 Paper Acceptance) paper

  • Built an NLP-based compliance framework that classifies mobile medical apps based on FDA & FTC regulatory risk, improving oversight of AI-driven health applications..

  • Integrated spatially aware feature representations to enhance generalization in ML models.

  • Defined a novel mathematical framework for integrating domain knowledge into ML pipelines, optimizing model explainability.

🔹 Machine Learning & GIS for Environmental Policy (EPA & CESAC Research, 2023) project-poster

  • Built ML & GIS models to analyze Brownfield funding allocation & clustering patterns.

  • Applied Random Forest (44% variance explained), Moran’s I, and Spatial Error Models (SEM) to study geographic disparities in environmental funding.

🔹 Virtual Reality & AI for STEM Education (Montclair State University – Virtual Reality for Education Lab, 2023) website

  • Contributed to “The Human Brain Time”, a fully immersive VR application for neuroanatomy education, now deployed on Meta Quest App Lab, expanding access to interactive STEM learning.
    Career Experience - Applied Research & Data Science

Career Experience - Applied Research & Data Science

(Industry work, and real-world applications of AI)

🔹 Applied Machine Learning Researcher, Montclair State University (2024-Present) - Hao Liu Data Science Lab website

  • Improved diagnostic predictions by developing a multimodal healthcare analytics pipeline that integrated EHR records with clinical notes using NLP, and data mining techniques; featured on MSU’s Data Science Lab website.

  • Enabled scalable analysis and model deployment by structuring healthcare data pipelines across hybrid warehouse/lake architectures using SQL, NoSQL, and JSON

  • Accelerated data transformation and insight generation by orchestrating distributed processing workflows in Spark and Snowflake.

  • Achieved 94% accuracy in comorbidity risk classification using interpretable ML models (SHAP, LIME), supporting transparent, cross-functional decision-making for clinical stakeholders.

🔹 Graduate Researcher - School of Computing, Montclair State University (2023-Present)

  • Built automated data pipelines using Python and Airflow to ingest 1,400+ medical app descriptions and over 220,000 user reviews, powering large-scale regulatory compliance analysis.

  • Applied Spark-based classification workflows with Scala to process high-volume data; implemented drift detection scripts to track schema and metadata shifts as app ecosystems evolved.

  • Achieved 85% accuracy in app compliance prediction under HIPAA, FTC, and FDA standards by training NLP models with BERT, spaCy, and NLTK, reducing manual audit workload.

  • Enhanced monitoring precision by 30% through user sentiment analysis with VADER and TextBlob, surfacing latent regulatory red flags.

  • Integrated Snowflake and AWS Glue (including Glue Data Catalog) to automate metadata tagging and streamline schema tracking across classification runs.

🔹 Hu-Au Virtual Reality Lab (2023-2024) website

  • Guided students in VR development & AI integration, helping them understand 3D modeling, interactive simulation design, and AI-driven learning environments.

  • Led technical training sessions on Unity, C#, and Python for VR applications, bridging AI & virtual learning technology.

  • Collaborated with faculty to expand research initiatives, ensuring projects align with educational technology advancements and funding opportunities.

🔹 Clean Energy Sustainability and Analytics Center (2023-2024)

  • Mentored undergraduate students & new research assistants, guiding them through GIS data preprocessing, ML model development, and environmental policy applications.

  • Led team discussions & technical workshops on spatial machine learning, geostatistical modeling, and energy forecasting techniques.

  • Assisted in writing & reviewing research proposals, ensuring projects aligned with policy impact and scientific rigor.

🔹 Bristol Myers Squibb Science Scholars (2024-Present)

  • Led research on open language models for predictive healthcare analytics, mentoring students and collaborating with industry professionals.

  • Presented research findings to BMS leadership, healthcare researchers, and data science professionals.

🔹 Data Analyst - Port Authority of New York & New Jersey (2024)

  • Automated analytics for 250K+ EV charging records by developing robust ETL workflows in Python and SQL, integrating MySQL data with AWS S3 and Redshift for daily operational reporting.

  • Forecasted charging demand and detected anomaly-driven downtime spikes (>20%) using predictive models built with PySpark, informing proactive maintenance planning.

  • Developed interactive dashboards in Tableau visualizing station uptime, fault trends, and site-level reliability, used by infrastructure teams for performance reviews and supporting business strategies.

  • Improved pipeline robustness by 30% through SQL tuning and data validation checks that flagged missing values, schema drift, and integrity issues.

  • Supported scalable, event-driven reporting aligned with core business strategies by integrating REST APIs with AWS Lambda and Glue as the EV network expanded across New York and New Jersey operations.

🔹 Data Analyst - Dozie & Dozie’s Pharmaceuticals (2019-2023)

  • Analyzed 100K+ patient records using SQL to uncover patterns in treatment outcomes, regional disease trends, and demographics, informing outreach and strategy across 30+ partner institutions.

  • Recovered ~$1.3M in unpaid claims (~85% recovery) by deploying statistical recovery models in Python and visualizing receivables through dynamic Power BI dashboards hosted on AWS.

  • Designed and implemented claims reporting frameworks that improved transparency and process improvement across healthcare partners, driving a 27% increase in revenue while maintaining regulatory compliance.

  • Delivered actionable insights to executive and partner teams, aligning financial recovery, patient outcomes, and public health priorities into one data-driven strategy.


Awards & Achievements

(Competitive recognition, honor societies)

  • Winner, RAISE 2025 – "Our Future With AI: Utopian or Dystopian?" (AI ethics & policy challenge.) LinkedIn post

  • Alpha Epsilon Lambda Honor Society (Top 5% of Graduate Students at Montclair State University.)

  • President & Founder, NextGen Tech Thinkers Club (MSU), a student-led initiative fostering AI, data science, and emerging technologies discussions.

  • Featured, MSU Communications for exemplary internship and research

  • Tri-Alpha Honor Society (Delta Chi Chapter, MSU)