Participated in a Phase III RCT evaluating progression-free survival for an immuno-oncology drug, contributing to statistical programming and primary efficacy analysis.
Built ADaM datasets from raw SDTM using SAS 9.4, ensuring CDISC-compliant traceability and alignment with SAP specifications.
Developed Cox proportional hazards models using PROC PHREG and conducted subgroup KM plots via R (survival, ggplot2) for clinical insight.
Produced validated TLFs to support FDA NDA submission, including stratified efficacy listings and interactive safety tables.
Assisted in reviewing the Statistical Analysis Plan, refining censoring rules and sensitivity analysis scenarios based on feedback from senior biostatisticians.
Collaborated with the data management team to resolve queries and support database lock.
Delivered documentation and submission-ready programs via Git-based version control, reducing code review time by 30%.
Presented key results and risk interpretation during internal regulatory meetings, supporting final CSR decision making.
Developed a cost prediction model using Python (pandas, scikit-learn) to analyze Medicare Advantage claims and support premium pricing strategies.
Engineered 150+ features capturing utilization patterns, chronic conditions, and demographics using SQL (Snowflake) and Python.
Trained and tuned GLM, Ridge, and XGBoost models to optimize RMSE, improving R² by 18% over baseline actuarial models.
Used SHAP (Python) to interpret model outputs and provide explainability to pricing and actuarial teams.
Integrated scoring pipelines into Snowflake and deployed batch runs using Airflow + Python operators.
Presented insights with model dashboards built in Tableau, and supported actuarial modeling decisions for the next cycle.
Led statistical analysis for post-market surveillance of a respiratory vaccine, integrating EHR + VAERS data to track safety signals in near real-time.
Cleaned and processed over 10 million records using SQL (PostgreSQL) and pandas, building structured datasets for signal detection.
Applied PRR and EBGM algorithms using custom R scripts to identify adverse event-vaccine pairs with statistically significant elevation.
Built a Shiny dashboard enabling pharmacovigilance teams to drill into event rates by region, age, and comorbidity profiles.
Automated safety metric updates using Airflow + Rscript, reducing manual reporting effort and ensuring consistency in weekly reviews.
Cross-validated internal findings with published studies using meta and forestplot packages to enhance medical credibility.
Supported regulatory reporting by contributing to two safety memorandums submitted to the FDA and EMA.
Generated reproducible analytics pipelines using R Markdown, which were later adopted by two other product safety teams.
Conducted an RWE study using Optum EHR to compare treatment outcomes between two oral diabetes therapies.
Extracted and filtered patient records using SQL + SQLAlchemy (Python) with custom inclusion/exclusion rules.
Applied propensity score matching (PSM) using R (MatchIt, tableone) to minimize bias from treatment selection.
Evaluated endpoints like HbA1c reduction, ER visit rates, and treatment discontinuation using Cox and logistic models.
Created stratified subgroup analyses across age, gender, and comorbidity level using survival and survminer.
Built plots and comparative tables with ggplot2 and exported to LaTeX for medical review.
Results were included in a publication draft submitted to JAMA Open and used by medical affairs to guide promotion strategy.
Demonstrated real-world effectiveness signal that supported expanded payer coverage in two regions.
Performed independent QC programming of efficacy/safety endpoints for solid tumor NDA using SAS Base/Macro + R.
Validated all ADaM datasets using Pinnacle21 and custom R scripts to confirm CDISC compliance.
Built traceability documentation linking derived variables to SAP-specified endpoints with dataset lineage.
Created macros and reusable scripts to standardize review across 40+ TLFs.
Flagged and corrected logic errors in adverse event derivation that would have delayed submission.
Assisted regulatory writing by providing footnote explanations, analysis rationale, and visual outputs.
Delivered clean QC package two weeks ahead of timeline, reducing regulatory audit preparation time by 30%.