< Research and Development

Referenced Materials

Background Research

  • https://www.informatik.hu-berlin.de/de/forschung/gebiete/ki/mac/lehre/lehrmaterial/Informationsintegration/Rahm00.pdf
  • http://betterevaluation.org/sites/default/files/data_cleaning.pdf
  • http://db.cs.berkeley.edu/jmh/papers/cleaning-unece.pdf
  • http://idlewords.com/talks/haunted_by_data.htm
  • http://blog.kaggle.com/category/dojo/
  • http://docs.aws.amazon.com/machine-learning/latest/dg/data-insights.html
  • https://github.com/OpenRefine/OpenRefine/wiki
  • http://blogs.msdn.com/b/bluewatersql/archive/2014/10/01/azuremachinelearningdata-preparation.aspx
  • http://blog.efpsa.org/2015/09/01/introducing-jasp-a-free-and-intuitive-statistics-software-that-might-finally-replace-spss/
  • https://zeppelin.incubator.apache.org/
  • http://priceonomics.com/should-you-ever-use-a-pie-chart/

Technical Development

  • http://pandas.pydata.org/
  • http://stackoverflow.com/questions/26244309/how-to-analyze-all-duplicate-entries-in-this-pandas-dataframe
  • https://scotch.io/tutorials/angular-routing-using-ui-router
  • http://sebastianraschka.com/Articles/2014_about_feature_scaling.html
  • http://ipython.org/