Labs, Problem Sets, and data for LEGALST-190: Data, Prediction, and Law.
Data, Prediction, and Law is a new Legal Studies seminar that allows students to explore different data sources that scholars and government officials use to make generalizations and predictions in the realm of law. The course will also introduce critiques of predictive techniques in law. Students will apply the statistical and Python programming skills from Foundations of Data Science to examine a traditional social science dataset, “big data” related to law, and legal text data.
Lab Date | Summary | Data | Interact Link |
---|---|---|---|
1-18-18 | Python, Jupyter, Tables | NCVS Incident-Record-Type | https://bit.ly/2sBzhhy |
1-23-18 | Table operations, Scatter Plots, Histograms, Probability | NCVS | https://bit.ly/2Jegu2Q |
1-25-18 | Empirical Distributions and Hypothesis Testing | NCVS | https://bit.ly/2LXzQeC |
1-30-18 | Bootstrap and Confidence Intervals | American National Elections Study (ANES) 2016 Election | https://bit.ly/2kMgtsA |
2-6-18 | Intro to Folium | us-states.json | https://bit.ly/2JaYjv3 |
2-8-18 | Folium: Choropleth Maps | us-states.json, US Unemployment October 2012 | https://bit.ly/2kOXbmt |
2-13-18 | Folium: Heat Maps | us-states.json, US Unemployment October 2012 | https://bit.ly/2JvtVie |
2-15-18 | Folium plugins: Search and Draw | us-states.json, US Unemployment October 2012 | https://bit.ly/2LUJQFw |
2-20-18 | Intro to Numpy and Scipy: numerical operations | None | https://bit.ly/2LiXxgd |
2-22-18 | Intro to Regression and the Test-Train-Validation Split | Bike Sharing | https://bit.ly/2xBC4NG |
2-27-18 | Model Selection and Cross Validation | Bike Sharing | https://bit.ly/2xDKFiF |
3-1-18 | Text Preprocessing : Stemming, Chunking, Tokenizing | UN General Assembly Transcripts | https://bit.ly/2sEdPZc |
3-6-18 | Introduction to Text Analysis : Document-Term Matrix | UN General Assembly Transcript | https://bit.ly/2J8nfre |
3-13-18 | Web Scraping and XML Parsing | Old Bailey Online Corpus | https://bit.ly/2LgDleW |
3-15-18 | Regular Expressions | Old Bailey Online Corpus | https://bit.ly/2sszsN4 |
3-20-18 | TF-IDF and Classification: Naive Bayes, Multinomial Logistic, SVM | Stack Exchange Queries | https://bit.ly/2syOyj2 |
3-22-18 | Exploratory Data Analysis: Feature Extraction, Visualizations, PCA | 2016 US Presidential Campaign Speeches | https://bit.ly/2kMh1ic |
4-3-18 | Neural Nets: Multi-Layered Perceptron, Convolutional Neural Netowkrs | MNIST | https://bit.ly/2J491aC |
4-5-18 | Word2Vec and Word Embeddings | UN General Debate Transcripts | https://bit.ly/2kJxHXn |
4-10-18 | Topic Modeling: Latent Dirichlet Analysis in Gensim and Scikit-learn | UN General Debate Transcripts | https://bit.ly/2xBEzQ4 |
4-12-18 | Text Analysis: Sentiment, Morality, Non-Negative Matric Factorization | Old Bailey Online Corpus | https://bit.ly/2JdMNPF |
4-17-18 | Feature Selection | Bike Sharing | https://bit.ly/2Hjso9U |
4-19-18 | Decision Trees and Ensemble Methods | Bike Sharing | https://bit.ly/2xDvSEV |