Teradata Open its Data Lake Management Strategy with Kylo: Literally

Still distilling good results from the acquisition of former consultancy company Think Big Analytics, Teradata, a powerhouse in the data management market took one step further to expand its data management stack and to make an interesting contribution to the open source community.

Fully developed by the team at Think Big Analytics, in March of 2017 the company launched Kylo –a full data lake management solution– but with an interesting twist: as a contribution to the open source community.

Offered as an open source project under the Apache 2.0 license Kylo is, according to Teradata, a new enterprise-ready data lake management platform that enables self-service data ingestion and preparation, as well the necessary functionality for managing metadata, governance and security.

One appealing aspect of Kylo is it was developed over an eight year period, as the result of number of internal projects with Fortune 1000 customers which has enabled Teradata to incorporate several best practices within Kylo. This way, Teradata has given the project the necessary maturity and testing under real production environments to launch a mature product.

Using some of the latest open source capabilities, including Apache Hadoop, Apache Spark and Apache NiFi, Kylo was designed by Teradata aiming to help organizations address common challenges of a data lake implementation and provide those common use cases the will enable reduced implementation cycles that average 6 to 12 months.

Teradata’s decision to release Kylo through an open source model instead of a traditional commercial one comes also within an interesting spinoff.

Traditionally a fully commercial software provider, the company has had in recent years a core transformation, being increasingly open to new business models and approaches, including its Teradata Everywhere strategy to enable increasing access to Teradata solutions and services in all possible on-premises and cloud platforms.

This broad strategy includes increased support for the open source community, such is the case with  the Hadoop community on different projects, Presto, and now of course with Kylo.

Teradata’s business model for Kylo is based the services its big data services company Think Big can offer on top of Kylo, these optional services include support training, as well as implementation and managed services.
According to Teradata, Kylo will enable organizations to address specific challenges implied within common data lake implementation efforts, including:

  • Shortage of skilled and experienced software engineers and administrators
  • Implementation of best practices regarding data lake governance
  • Reinforce data lake adoption beyond engineers and specific IT teams
Teradata aims with Kylo for a data lake platform that requires no code and enable self-service data ingest and preparation via an intuitive user interface to help accelerate the development process by enabling reusable templates to increase productivity.

From a functions and features perspective, Kylo has been designed to provide the necessary data management capabilities for the deployment of a data lake:

  • Data Ingestion. Self-service data ingest capabilities along with data cleansing, validation, and automatic profiling.
  • Data Preparation.  Handling data capabilities through a visual SQL and interactive data transformation  user interface.
  • Data Discovery. Data searching and exploration capabilities as well as metadata, view lineage, and profile statistics.
  • Data Monitoring. Data monitoring capabilities for health of feeds and services through the complete data lake as well as tracking service level agreements (SLA’s) and troubleshoot performance.
  • Data Pipeline Design. Capabilities for designing batch and/or streaming pipeline templates in Apache NiFi to be registered with Kylo, allowing user self-services.

As per words from Oliver Ratzesberger, Executive Vice President and Chief Product Officer at Teradata:
“Kylo is an exciting first in open source data lake management, and perfectly represents Teradata’s vision around big data, analytics, and open source software. Teradata has a rich history in the development of many open source projects, including Presto and Covalent. We know how commercial and open source should work together. So we engineer the best of both worlds, and we pioneer new approaches to open source software as part of our customer-choice strategy, improving the commercial and open source landscape for everyone.”

With Kylo, Teradata aspires to play a leadership role in the data lake, governance, and stewardship market, yet a difficult goal as niche vendors like Zaloni and Podium Data or big vendor like Informatica with its Data Lake Management solution stack but at first, it looks like a solution to follow closely, especially considering price point due to its business model versus the other commercial offerings.

Want more information?

Kylo software, documentation and tutorials can be found in the Kylo project website or at the project’s GitHub site, or check the following video and its page on Youtube:

Teradata Open its Data Lake Management Strategy with Kylo: Literally Teradata Open its Data Lake Management Strategy with Kylo: Literally Reviewed by Jorge Garcia on February 05, 2018 Rating: 5


  1. The appropriate response is yes. Information Science/Analytics making bunch employments in every one of the areas over the globe. data science course in pune


  2. Thank you so much for helping me out to find the Data science course in Mumbai
    and introducing reputed stalwarts in the industry dealing with data analyzing & assorting it in a structured and precise manner. Keep up the good work. Looking forward to view more from you.

  3. Just saying thanks will not just be sufficient, for the fantastic lucidity in your writing. I will instantly grab your articles to get deeper into the topic. And as the same way ExcelR also helps organisations by providing
    Data science course based on practical knowledge and theoretical concepts. It offers the best value in training services combined with the support of our creative staff to provide meaningful solution that suits your learning needs.

  4. I feel very grateful that I read this. It is very helpful and very informative and I really learned a lot from it.
    data science

  5. Such a very useful article. I have learn some new information.thanks for sharing.
    data scientist course in mumbai

  6. cool stuff you have and you keep overhaul every one of us
    Data Analytics Course in Mumbai

  7. cool stuff you have and you keep overhaul every one of us
    Data Analytics Course in Mumbai

  8. Such a very useful article. Very interesting to read this article. I have learn some new information.thanks for sharing. ExcelR

  9. Really nice and interesting post. I was looking for this kind of information and enjoyed reading this one. Keep posting. Thanks for sharing.
    ExcelR Business Analytics Course

  10. Very nice blog here and thanks for post it.. Keep blogging...
    ExcelR data science training

  11. Attend The PMP Certification From ExcelR. Practical PMP Certification Sessions With Assured Placement Support From Experienced Faculty. ExcelR Offers The PMP Certification.
    ExcelR PMP Certification

  12. I really enjoy simply reading all of your weblogs. Simply wanted to inform you that you have people like me who appreciate your work. Definitely a great post. Hats off to you! The information that you have provided is very helpful.
    ExcelR data analytics courses

  13. I just got to this amazing site not long ago. I was actually captured with the piece of resources you have got here. Big thumbs up for making such wonderful blog page!
    Data science

  14. This is also a very good post which I really enjoyed reading. It is not every day that I have the possibility to see something like this.. best data science course in bangalore


  15. Very Good Information...

    Data science Course in Pune

    Thank You Very Much For Sharing These Nice Tips..

  16. Interesting post. I Have Been wondering about this issue, so thanks for posting. Pretty cool post.It 's really very nice and Useful post.I am interested in some of them.I hope you will give more information on this topics in your next articles.
    Data Science training
    data analytics course
    business analytic course

  17. Excellent erudition Providing by your Article, thank you for taking the time to share with us such a nice article. Amazing insight you have on this, it's nice to find a website that details so much information about different artists. Kindly visit the LiveWebTutors website we providing the best online assignment help services in Australia.

    For More Info: Assignment Help Perth


Powered by Blogger.