- Intake-cmip, a plugin for reading CMIP5, and CMIP6 data using intake.
- Esmlab, Tools for working with earth system multi-model analyses with xarray.
Software Engineer, National Center for Atmospheric Research
October 2018 - Present | Boulder, CO
- Working on the Pangeo team.
Software Developer Intern, Quansight
June 2018 - September 2018 (3 months)
Implemented datetime, categorical accessors for dask-cudf: Partitioned gpu-backed dataframe, using Dask.
Contributed to xnd: a library for refactoring of NumPy capabilities to low-level libraries and high-level interfaces.
Contributed to cudf (a Python interface to access and manipulate the GPU DataFrame) with Apache Arrow library.
Data Scientist Intern, First Orion
December 2017 - May 2018 (6 months) | Little Rock, AR
Machine Learning: Designed and built scoring, predictive models with Scikit-learn using First Orion’ s proprietary telecommunication data.
Data Processing: Identified patterns and characteristics within First Orion’s data warehouses using Dask, Apache Spark, Pandas.
Research Intern, National Center for Atmospheric Research
May 2017 - August 2017 (3 months) | Boulder, CO
Installation: Installed Apache Spark on both Cheyenne and Yellowstone Supercomputers.
Schedulers: Cleaned/fixed Spark launch bash scripts that work with the LSF/PBS schedulers.
spark-xarray: Wrote spark-xarray, a python package that integrates PySpark and xarray for Climate Data Analysis.
Jupyter notebooks contribution: Contributed Jupyter notebooks and scripts using Apache Spark to NCAR’s Coupled Model Intercomparison Project (CMIP) Analysis Platform.
Documentation: Documented research work at https://ncar.github.io/PySpark4Climate/
University of Arkansas at Little Rock
(2014 - 2018) | Little Rock, AR
Bachelor of Science, Systems Engineering
Languages & Frameworks
- Apache Spark
- Google Cloud Platform