CV - Raphael Zhu

604.834.0061 | Raphael.z98@outlook.com

Education

The University of British Columbia
Vancouver, BC

Master of Science: Major in Statistics
Sep 2020 - May 2022

International Tuition Award


The Ohio State University
Columbus, OH

Bachelor of Science: Honours Physics, Minor in Statistics and Mathematics
Aug 2016 - May 2020

Magna Cum Laude, James Smith Scholarship



Skills

  • Programming: Python, SQL, R, Pine Script, Java and C++; PowerShell, Zsh, Bash and Git.
  • Data Science: Snowflake, Azure, Databricks, Spark, Hive, Power BI and Mode Analytics.
  • Field Knowledge: Data Quality Assurance, Data Migration, Data Engineering, Machine Learning, Sales Growth Analysis, and Channel Incentive Optimization.

Experience

Budweiser APAC
Shanghai, CHN

Data Engineer / Data Scientist
Jul 2024 - Dec 2024

  • Data Engineering: Actively involved in middle-platform architecture design and multi-environment data governance. Rebuilt task dependencies across ODS-DW-ADS and effectively reduced intermediate layer redundancy. Enhanced Azure Data Factory ETL performance and improved Databricks cluster efficiency.
  • Data Science: Bridged sales business BI requirements, providing indicator data support for risk order inspections, performing sales and supply chain data preprocessing, optimizing DWS layer data modeling, assisting in feature extraction for Databricks ML models, and maintaining real-time data pipelines.
City of Burnaby
Vancouver, CAN

Data Analyst
Sep 2022 - Jun 2024

  • Data Science: Conducted in-depth analysis and delivered visualized reports through Azure Databricks. Supported municipal and community projects with information extraction and processing, performed comprehensive data analysis, and provided actionable strategic recommendations.
Yuyao Jewelry Co., Ltd.
Shanghai, CHN

Business Analyst & General Partner
Apr 2022 - Nov 2023

  • Constructed a Gemstone and Jewelry consultant service between customers and sourcing partners.
  • Built an international trading channel between Thailand, Sri Lanka, Pakistan and China.
  • Ran ecommerce stores on various platforms including Taobao and Douyin.
  • Promoted auction-grade hign-end jewelry from Thailand to Mainland China.
LucenFly
Vancouver, CAN

Co-Founder
May 2022 - Oct 2023

  • SquareSpace-powered website design, development and maintainance.
  • Gemstone/jewelry high-res photography.
  • Design and construct product database and SKU system.
  • Maintain and expand supply chain relationships.
Pacific Cloud Trading Inc.
Vancouver, CAN

General Partner
May 2022 - Present

  • Founded custom jewelry brand 'LucenFly'.
  • Conduct marketing analysis and draft sales strategy.
  • Perform internal audits including expenditure controlling, balance sheet reporting and growth monitoring.
Lightyear Health, Team of Data Science
Walnut Creek, USA

Data Scientist Co-op
Aug 2021 - Jan 2022

  • Data Migration: Migrated data to Snowflake data warehouse from HubSpot data lake. Created 30+ data schemas, and monitored the data mapping process.
  • Quality Assurance: Designed, implemented and maintained 10+ quality assurance (QA) scripts to detect anomalies and ensure the integrity of clinical and sales data.
  • Database Manipulation: Conducted data profiling, transformation, and restructuring with complex Snowflake SQL query and Python pipeline scripts.
  • Reporting and Dashboard: Generated and maintained 10+ periodic dashboards to visualize sales growth, identify investments and inform internal decision-making.
  • Data Analysis and Modeling: Optimized regional sales strategy and built a predictive diagnosis system with advanced machine learning models.
University of British Columbia, Department of Statistics
Vancouver, BC

Graduate Research Assistant
Sep 2021 - May 2022

  • Project Content: R packages "Distplyr" and "Distionary" development with Professor Vincenzo Coia.
  • Software Development: Provided 10+ infrastructure functions and test suites to aid software validation and deployment.
  • Program Documentation: Created 10+ informative demonstration examples in the documentation files to enhance software users' coding experience and facilitate future software maintenance.
  • Presentation and Knowledge-Sharing: Host department-wise seminar to share knowledge of statistical software development techniques and real-world applications.
University of British Columbia, Department of Statistics
Vancouver, BC

Graduate Academic Assistant
May 2021 - Aug 2021

  • Support department's online teaching requirements and developments of learning resources for STAT courses.
  • Apply innovative learning technologies (Canvas, WeBWorK, R) to traditional in-person classes.
  • Provide reviews on course material and develop new questions for exams.
University of British Columbia, Department of Statistics
Vancouver, BC

STCS Consultant
Apr 2021 - May 2022

  • Offer professional statistical assistance in short term for experiment design for research.
  • Draft project planning and schedule for clients with personalized requirements.
  • Provide explanations of statistical techniques, assistance in study designing and analysis of client's data set.
The Ohio State University, Department of Physics
Columbus, Ohio

Physics Tutor
Sep 2017 - Apr 2020

  • Helped students understand concepts in Physics classes and inspired them to solve relative questions.
  • Provided brief exam review sessions for groups of students who were attending Physics classes.
The Ohio State University, Holography Club
Columbus, Ohio

Membership and Holography Maker
Aug 2017 - Apr 2020

  • Produced small sized holography works.
  • Prepared and co-host the Holography Show.


Activities

Meta Signal | Backtrader, Shanghai
Sep 2024 - Present

  • Implemented multiple indicators and alphas for Backtrader framework workflow. Constructed a signal system based on tree models. Provided scripts for automated local MySQL database construction and online data retrieval.
Meta Divergence | TradingView, Vancouver
Jan 2024 - Jan 2025

  • An improved indicator supporting versatile divergences and thresholds for alerts. Python version supports machine learning optimization.
Chartist RaphaelZ | YouTube, Vancouver
Apr 2024

  • Provided basic introduction to financial investments.
  • Produced educational content for risk management, trading psychology, and trading strategy.
  • Illustrated the plan and execution of strategies.
Crypto Candlestick Patterns | TradingView, Vancouver
Nov 2023

  • A crypto market version of traditional candlestick patterns, with Chinese language option.
  • Improved the original "All Candlestick Patterns" indicator in signal illustration.
  • Added more options for adjusting recognization's precision.
Distplyr and Distionary | R Packages, Vancouver
Apr 2022

  • Distplyr: provides a grammar for manipulating the probability distributions under a clean and user-friendly framework.
  • Distionary: provides distribution types as dst_*() family of functions and provides the basic objects to be manipulated under distplyr.
  • Presented and introduced the two packages to public.
NLP Project on Sentence Similarity | UBC, Vancouver
Nov 2021

  • Cleaned up and preprocessing data with 400k+ entries from Kaggle's competition "Quora Question Pairs".
  • Developed an 85%-accuracy LSTM with Word2Vec model for question pair classification based on sentence similarity.
  • Researched on 1D-CNN embedding layer with class weights, TFIDF, and mover distance etc., 86% accuracy achieved.
Apr 2021

  • Exploratory data analysis and time series processing.
  • Developed a Bayesian structural time series model for seasonal prediction on S&P 500 index with 84% accuracy.
  • Built Gaussian process models in python for monthly prediction on S&P 500 index.
Mar 2021

  • Applied the survival analysis on the conversion rate of first purchasing of new users.
  • Constructed a Cox PH model and an Aalen's Additive regression model to recognize the vital factors on users' behavior.
  • Provided marketing strategy based on the key reported factors for specific targeted customers and markets.
Mar 2021

  • Collaborated in team and developed a longitudinal study focusing on the patients' evaluation of surgery outcomes.
  • Drafted exploratory analysis and univariate analysis, using in-depth visualization and sufficient A/B tests.
  • Preprocessed and restructured raw data with 1.5k+ entries for logistic regression and linear mixed effect models.
Nov 2020

  • Presented exploratory data analysis and missing values analysis on the hospital measurements of HCV patients.
  • Built OvR/MvM multiclass logistic-regression-based classifiers with 95.4/96.4% accuracy for the patient recognition.
  • Implemented a Naive Bayes Classifier with 93.81% accuracy as the benchmark for the multiclass classifier.
Materials and Sky | OSU, Columbus
Apr 2020

  • A photographic project of a series with diagonal layout.
  • Explore the collision between building materials and the sky.
  • Explore the intersection between geometric structures and natural views.
Data I/O | OSU, Columbus
Oct 2019

  • Completed the challenge of traffic analysis from Mobikit, for improving the safety function in auto driving.
  • Conducted modification on nominal data, data cleaning and missing values analysis.
  • Constructed logistic regression modelling and performed model selection.
DataFest | OSU, Columbus
Apr 2019

  • Data modification, feature selection, exploratory data analysis and multiple hypothesis test with high power.
  • Presented insights of the data and interpreted data visualization for non-academic audience.
  • Offered specific suggestions regarding daily lifestyles of university athletes.
Nov 2016

  • Tested complied codes and maintained coding consistence.
  • Designed an educational language based on JavaScript which could be compiled to low level subsets of JS and C.
  • Kept the language as strictly typed, multi-paradigm, and compiled with modern and clean style.