HOME  |  PAPERS |  NOTES  |  BLOG |  DATA |  SOFTWARE               

SOFTWARE
  1. indicate: transliterate indic languages to english
    Software

  2. notnews: predict the type of news based on story text and URL
    Software

  3. Infer Race and Ethnicity From Names:

  4. Infer Gender From Names:

  5. Search a long list of names (patterns) in a large text corpus systematically and quickly
    Software

  6. Categorize the Content of Domains:

  7. Lost Years: Expected Number of Years Lost
    Python Package

  8. pysum: summarize pandas dataframe
    Software

  9. Know Your IP
    Python Package | Application

  10. Highlight Citations to Retracted Articles
    Website | Code

  11. AutoSum: Summarize Publications Automatically and Discover Miscitations
    Software

  12. Adjust Naive Estimates of Learning for Guessing
    R package | Related Paper

  13. Get Weather Data:
    Please read this before downloading any of the following scripts.

    • Find nearest zip codes given a list of weather stations (COOP and GHCND) via
      GeoNames: Data & Scripts
    • Find nearest weather stations given a list of zip codes: Data & Scripts
    • Get data from the nearest weather station given a list of zip codes and date range
      Script
    • Get data from the nearest weather station given a list of zip codes and date range
      using the NOAA web-service:  Script

  14. Image to Text:
    Please read this before downloading any of the following scripts.

  15. Edit Distance Based Search and Replace
    Software | Related Note

  16. Text as Data:

    • Normalize text, remove stop words, punctuation, numbers, stem, lemmatize
      Script
    • Subset, Randomly Sample, Summarize: Script
    • Create TDM with various weighting schemes: Script
    • Sentiment Analysis: Script
    • Supervised Learning: Classification, Regression

  17. Clarifai: Understand (Moving) Images
    R package | Analysis of Politicians' Instagrams | Infer Gender Based on First Name

  18. tuber: Access YouTube from R
    R package
    REVIEW: 'Thank you very much for the package ... it has made my life easy ....'

  19. tubern: R Client for the YouTube Analytics and Reporting API
    R package

  20. virustotal: R Client for the Virustotal Public API 2.0
    R package

  21. aws.alexa: Access Amazon Alexa from R
    R package

  22. Collecting Data from the Streets:

  23. Collecting, Parsing, and Processing Indian Electoral Rolls:

    • Collecting Indian Electoral Rolls
      Python scripts

    • Elector Count: Estimate the Total Number of Electors in a State
      Python script

    • Table Translator: Use Google Translate API to Get Word Level Translations And Append Translated Cell Values Back
      Python script

  24. countpy.com: counting more than downloads
    website

  25. incline: Estimate Trend at a Particular Point in Time in a Noisy Time Series
    Python Package