The gathering stage entails buying the required knowledge with a view to carry out a significant evaluation primarily based upon correct info.
Strategies
Knowledge Necessities
Outline which knowledge is required to correctly strategy the challenge (e.g. format, variables, time vary, granularity)
Knowledge Sources
Discover dependable and related knowledge sources (e.g. databases, APIs, information, sensor readings)
Authentication
Safe obligatory permissions to entry the information (e.g. electronic mail/password, OAuth, API key, robots.txt)
Assortment
Purchase the information utilizing acceptable strategies (e.g. SQL queries, API calls, internet scraping, handbook knowledge entry)
Knowledge Administration
Deal with the information in accordance with greatest practices (e.g. knowledge high quality, knowledge governance, knowledge safety)