No data, no AI. Crap data, crap AI.
That is why we collect, clean and publish data for you, so you don't have to.
See how to download data from us for free with one click and how to get data directly to Python Pandas DataFrame in this short video:
The long version:
It is the data holding AI and data science back, not the science.
As reported by Forbes, Data Scientists persistently experience the preparation of data as the most time consuming part of their work.
We fix that.
To speed you up, we have collected, cleansed and standardized thousands datasets from tens of sources all over the world for you.
We've also handcrafted a number of geospatial data products for you that answer to what, where, when, why. Demographics, places, services, health, weather, traffic, air quality, forecasts, history... Tell us the location, we'll tell all about it literally with nanometer accuracy.
Need external data in your analysis? Go to our data catalogue, sign up and download datasets for free.
Internal data + external data = richer analytics!
Data catalogue with a dataset search. Automatically generated data audits and key figures for all columns of all datasets. Source data standardization and tagging, one click data samples with an interactive online data explorer for all datasets. Compatibility with the most commonly used tools and programming languages.
Data sourcing from different types of sources in a multitude of formats with automated data quality checks and type conversions. Data delivery with user access authorization via multiple channels and formats to support external data utilization for multiple user groups.
Open your valuable data asset to the public. Start monetizing and publishing your diverse data sources, whether private or open, via our well-defined, secure, industry standard APIs. Change your complex and expensive custom backend to our feature-rich and cost-efficient cloud data publishing service today.
To download datasets in csv format for free:
1. Sign in to our data store.
2. Search and find datasets of your interest.
3. Go to the dataset table page.
4. Click on the Download button.
(you must be logged in for the Download button to appear)
5. Double-click on the downloaded file to open it in Excel.
See video above for detailed guidance.
To get data directly to Python Pandas DataFrame:
1. Sign in to our data store.
2. Go to your profile page.
3. Find your API Key in bottom left corner, copy the key to notepad.
(the key format is 'xxxxxxxx-xxxx-xxxx-xxxx-xxxxxxxx')
4. Search and find datasets of your interest.
5. Go to the dataset table page and copy the page URL to notepad from the browser address bar.
(the url format is 'https://store.smartdatahub.io/dataset/…/resource/...')
6. Copy and paste the code snipplet below to your Python notebook or console, replacing <api_key> and <table_url> with your API Key and dataset table URL:
pip install --upgrade pip pip install sdhpy from sdhpy.pandas import SdhPandas sdh = SdhPandas(store_apikey="<api_key>") df = sdh.url("<table_url>").data df.head()
See video above or check examples in our GitHub for detailed guidance.
You can download datasets in CSV format or via our Python SDK for free by signing in to our data store. After sign in go to the dataset table page and click on the Download button to get a copy of the dataset. See instructions for data access.
Contact us for options for direct access to our data asset with your favorite analysis tools and APIs, or directly plugging our entire data asset into your enterprise data lake. Pricing from 199 per month.
Our CEO is a technology multi-talent and a pioneer in a number of next gen technologies such as computer vision, semantic analysis, parallel data processing, cloud computing and data networks. Mikko has an extensive experience in helping large private and public sector organizations in a broad range of technology applications.
Besides piloting our company, Mikko also flies real airplanes.
See more in LinkedIn
Karri is a seasoned technology entrepreneur and an expert in big data engineering, machine learning and cloud computing with profound understanding and strong hands-on expertise in next gen data architectures and advanced analytical methods. As a sought-after lecturer, trainer and advisor he has helped more than 30 large private and public sector organizations across different industries in data and analytics capability transformations.
In addition to parenting data architectures you can find Karri in his garden nurturing diverse species of nature.
See more in LinkedIn
Contact us for further information using any of the means below