indicate: transliterate indic languages to english
Software
notnews: predict the type of news based on story text and URL
Software
Infer Race and Ethnicity From Names:
Infer Gender From Names:
pranaam: Infer Religion From Indian Names
Python Package
Search a long list of names (patterns) in a large text corpus systematically and quickly
Software
Categorize the Content of Domains:
Know Your IP
Python Package | Application
Lost Years: Expected Number of Years Lost
Python Package
pysum: summarize pandas dataframe
Python Package
Highlight Citations to Retracted Articles
Code
AutoSum: Summarize Publications Automatically and Discover Miscitations
Software
Adjust Naive Estimates of Learning for Guessing
R package | Related Paper
Get Weather Data:
Please read this before downloading any of the following scripts.
- Find nearest zip codes given a list of weather stations (COOP and GHCND) via
GeoNames: Data & Scripts
- Find nearest weather stations given a list of zip codes: Data & Scripts
- Get data from the nearest weather station given a list of zip codes and date range
Script
- Get data from the nearest weather station given a list of zip codes and date range
using the NOAA web-service: Script
Image to Text:
Please read this before downloading any of the following scripts.
Edit Distance Based Search and Replace
Software | Related Note
Text as Data:
- Normalize text, remove stop words, punctuation, numbers, stem, lemmatize
Script
- Subset, Randomly Sample, Summarize: Script
- Create TDM with various weighting schemes: Script
- Sentiment Analysis: Script
- Supervised Learning: Classification, Regression
Clarifai: Understand (Moving) Images
R package | Analysis of Politicians' Instagrams | Infer Gender Based on First Name
tuber: Access YouTube from R
R package
REVIEW: 'Thank you very much for the package ... it has made my life easy ....'
tubern: R Client for the YouTube Analytics and Reporting API
R package
virustotal: R Client for the Virustotal Public API 2.0
R package
aws.alexa: Access Amazon Alexa from R
R package
Collecting Data from the Streets:
Collecting, Parsing, and Processing Indian Electoral Rolls:
- Collecting Indian Electoral Rolls
Python scripts
Elector Count: Estimate the Total Number of Electors in a State
Python script
Table Translator: Use Google Translate API to Get Word Level Translations And Append Translated Cell Values Back
Python script
countpy.com: counting more than downloads
website
incline: Estimate Trend at a Particular Point in Time in a Noisy Time Series
Python Package