Skip to content
/ dsi Public

Sample code and other artifacts for data science and analytics at the UCSF Library

License

Notifications You must be signed in to change notification settings

ucsf-ckm/dsi

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science Initiative

Sample code and other artifacts for data science and analytics at the CKM

Overview

This repository contains code samples during programming and pizza, consulting sessions, workshops, and other events offerend through the UCSF Library's Data Science Initiative.

Repository for in class python exercises for a workshop on using SQL and Pandas together for data wrangling.

Using sql with dataframes to select rows where a max and min value occurs when there are multiple rows with a measurement.

Powerpoint and supporting files for workshop on database creation, normalization.

Python notebook for field input, basic logic, assignment, and control flow, charting, for Now Playing Event April 12, 2017.

Python notebooks for using python, sql, and pandas dataframes to make a genomic spreadsheet easier to query and analyse. Blog write up available at https://blogs.library.ucsf.edu/ckm/2016/09/27/data-munging-with-python-sql-and-excel/.

Python notebook for analyzing the 3v2 dice roll in the board game Risk. Simulation and full calculation. Part of Pi Day events March 14, 2017 at the UCSF Library.

Python notebook containing files for intro to python workshop. Assignment, varaibles, loops, conditionals, methods.

Python notebook using beautifulsoup to parse and collect researcher informations from profiles api.

Python notebook to illustrate a row-by-row calculation applied to a pandas dataframe.

Python notebook using pandas and sql to analyze an activity log.

Python notebook using sql and pandas to illustrate 1) converting columns to rows in a sql table, and 2) joining multiple tables with varying matching columns.

Python notebook illustrating how to use a python script to process health issues at geographical locations. Input and output files are in excel.

About

Sample code and other artifacts for data science and analytics at the UCSF Library

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published