My passion lies in harnessing the power of data to create better lives for all.

I live in San Fransisco, but I mostly breathe in Silicon Valley during weekdays. My expertise and interests include statistical machine learning, predictive modeling and visualizations, and product management.

I'm an advocate of data revolution for sustainable development. When I'm not playing with data, I enjoy running, painting, and spending quality time with loved ones. For more information, please contact me at moorissa.tjokro@columbia.edu.

Featured Work


American Journal of Lifestyle Medicine, 2018
Used statistical models to evaluate feasibility of Community-based, Supervised Exercise Programs (CSEP) for improving health of those suffering chronic medical conditions.

Machine Learning

Building Spotify’s “Discover Weekly” with Spark
Collaborative filtering algorithm in an audio recommendation system with MLlib & PySpark.
Developing a Matching Algorithm
Classification models, feature importances, & pairwise comparison in an entity resolution.
Direct Marketing Optimization using Mobile Data
Classification ensembles for improving growth forecasts in user subscriptions.
Leveraging Philanthropy Impacts with Data Mining
An overview of Beyond Profit Project sponsored by Bloomberg Philanthropies.
Meta-Learning for Credit Card Fraud Detection
Research study & presentation on fraud detection with bayesian net, knn, & decision trees.
Predicting NYC Renting Prices using Lasso Regression
Linear models for predicting next month's rent across New York neighborhoods.
Recommendation Systems for Purchase Data
An implementation of popularity & collaborative filtering models with Python & Turicreate.

Deep Learning

Improved Wasserstein Generative Adversarial Networks (GANs)
Building an improved version of Facebook's Deep Convolutional GANs implementation.
Starting out with Keras
Attempts in multilayer perceptrons and convolutional neural networks.

Natural Language Processing

Can a Chat a Day Keep the Doctor Away?
Building an end-to-end healthcare messenger bot using NLP and matching algorithm.
Exploring Trending Topic Bias in News vs. Social Media
An NLP-based analysis of topics on New York Times data versus Twitter streams.
Making Boston Safer using Natural Language Processing
A set of classification methods to predict text data & model semantic categories.
Topic Modeling for The New York Times News Dataset
Nonnegative Matrix Factorization (NMF) approach for classifying news topics.

Data Visualizations

Legacy of a Century: South Africa Today
Exploration of the nation's journey after the life of Nelson Mandela (please use Chrome).
Comparing Marvel and DC Superheroes
An attempt to settle the age-old fight with data and D3.
Exploratory Data Analysis & Visualization Resources
Site repository of visualization resources with Javascript, HTML, CSS, and SVG.
How does Trump's budget cut affect you?
Effects of Trump's billion-dollar cuts in city transportation with R, Carto, and Processing.
Ranking the Top 100 Sci-fi Books
Using D3 and Javascript to observe patterns.
Visualizations with R
Compilation of STAT GR5702 course assignments in descriptive statistics.
Visualizing the World's Poverty Rates
A quick attempt to visualize poverty rates using D3 and UN open source data.
What Makes Us Happy?
Comparison of happiness of countries around the world.


  • It's the possibility of having a dream come true that makes life interesting.

    Paulo Coelho
  • If I had asked people what they wanted, they would've said faster horses.

    Henry Ford
  • We make a living by what we get. We make a life by what we give.

    Winston Churchill


Columbia University

M.S. in Data Science Dec '17

Coursework: Machine Learning, Applied Machine Learning, Deep Learning & Neural Networks, Algorithms, Exploratory Data Analysis & Visualizations, Computer Systems, Bayesian Modeling, Storytelling with Data, Tech Entrepreneurship.

Georgia Institute of Technology

B.S. in Industrial Engineering with Statistics May '14
Graduated Summa Cum Laude

Relevant Coursework: Probability Theory, Statistical Inference and Modeling, Database Systems Design and Manipulation, Regression and Forecasting, Quality Control, Optimization, Reliability Engineering (graduate level), Stochastic and Queueing Theory.



Data Scientist Mar '18 - present

Built internal end-to-end products to support 10,000+ worldwide charging stalls DevOps & Infrastructure, where I led projects involving time series, machine learning (regression, classification, NLP), metrics development, and exploratory analyses of complex data.

NASA Goddard Institute for Space Studies

Machine Learning Intern Oct '17 - Jan '18

Constructed unsupervised clustering algorithms to assess ocean carbon cycle models and their atmospheric properties for ModelE climate simulations.

Columbia University

Graduate Teaching Assistant May '17 - Aug '17

Supervised the Applied Analytics capstone course (~140 graduate students) covering scenario modeling, data democratization, and information network mining in healthcare.

NBC Universal

Data Scientist Intern May '17 - Aug '17

Performed statistical inference, multivariate analyses, sampling, and clustering from high dimensional consumer data. Built and automated R&D tools using Spark & Python.

Target Marketeam, Inc.

Data Analyst Jul '14 - Jun '16

Managed data warehouses, developed linear models, and built decision support tools for nonprofit sectors. Selected as Lead Analyst and collaborated with the SVP of Analytics and the Head of IT to perform A/B testing and experimental design for direct mail products.

United Nations World Food Programme

Research Assistant Aug '13 - May '14

Constructed centralized hub models for Specialized Nutritious Foods with Dr. Nazzal and Spatial Risk Calendar team. Valuation resulted in 30% food shortage decrease in Zambia.

Georgia Institute of Technology

Computer Science & Statistics Teaching Assistant Dec '12 - Dec '13

Led weekly recitations, grade exams, and tutored students for Data Manipulation & Database Systems and Applied Statistics course (~650 students).

Technical Skills

  • Python, SK-Learn, Tensorflow, Keras, Spark
  • SQL, JavaScript, D3, HTML/CSS/SVG, Carto
  • R, Hadoop/MapReduce
  • SAS, JMP

Selected Honors

Columbia Annual Data Science
Hackathon, 1st Place Winner
Data Science Institute '17
Columbia Impact Hackathon,
1st Place Winner
Columbia Business School '16
Helen Grenga Nominee for Outstanding Woman Engineer
Georgia Institute of Technology '14
Rockwell Automation
Society of Women Engineers '13
Shannon & Wilson Technology Scholar
Shannon & Wilson, Inc. '11
International Leadership Award
International House New York '17
Toyota Scholarship
International House New York '16
President’s Undergraduate Research Award
Georgia Tech Research Institute '14
Faculty Honors
Georgia Institute of Technology '12
Dean's List
Georgia Institute of Technology '11-'14
Student Spotlight
Seattle Colleges Foundation '11

Fun Fact

I eat almost everything with spicy sauce, so yes, I'm that weird
and intense!