Programming is an integral part of data science. Among other things, it is considered that a mind which understands programming logic, loops, and functions has higher chances of becoming a successful data scientist. So, what about people who never studied programming subject in their school or college ?
The list of tools
- RapidMiner (RM)
RM covers the entire life-cycle of prediction modeling, starting from data preparation to model building and finally validation and deployment.
- DataRobot (DR)
DataRobot (DR) is a highly automated machine-learning platform built by all time best Kagglers including Jeremy Achin, Thoman DeGodoy and Owen Zhang.
BigML is another platform with ~Mn USD in funding. It provides a good GUI which takes the user through 6 steps.
- Google Cloud Prediction API
The Google Cloud Prediction API offers RESTful APIs for building machine learning models for android applications.
Paxata is one of the few organizations which focus on data cleaning and preparation, NOT the machine learning or statistical modeling part.
Trifacta is another startup focused on data preparation.It has 2 products offering:
- Wrangler – a free stand-alone software
- Wrangler Enterprise – licensed professional version
- Narrative Science
Narrative Science is based on a unique idea in the sense that it generates automated reports using data.
MLBase is an open-source project developed by AMP (Algorithms Machines People) Lab at University of California, Berkeley.
Weka is a data mining software written in Java, developed at the Machine Learning Group at University of Waikato, New Zealand.
- Automatic Statistician
Automatic Statistician is not a product per se but a research organization which is creating a data exploration and analysis tool.
Please note: This article is for information purposes only and Data Futures Hub does not endorse any specific program or company.