Damon Feldman, a solutions director at MarkLogic, discusses the benefits of integrating a data hub and a data lake. A data lake is a vast pool of raw data, while a data hub acts as a central point for data integration, allowing for the management and organisation of data. The combination of these two systems can offer a comprehensive solution for data storage and management.
Feldman highlights that a data lake is beneficial for storing large amounts of data, but can be challenging to navigate without proper organisation. On the other hand, a data hub can efficiently manage and organise data, but may struggle with large data volumes. Therefore, combining a data hub and a data lake can offer the best of both worlds, providing a system that can handle vast data quantities while maintaining organisation and accessibility.
The integration of these two systems allows for better data governance, as data can be tracked and managed more effectively. Additionally, it offers improved data security, as the data hub can provide secure access controls. Lastly, this combination enables quicker data access and analysis, as data can be easily located and processed.
Feldman concludes by stating that a combined data hub and lake system can offer a holistic approach to data management, addressing the challenges of data volume, variety, and velocity.
Go to source article: https://www.oreilly.com/ideas/damon-feldman-on-combining-a-data-hub-and-data-lake?imm_mid=0f769c&cmp=em-data-na-na-newsltr_20171025