Apache Kudu is a free and open source column-oriented data store of the Apache Hadoop ecosystem. It is compatible with most of the data processing frameworks in the Hadoop environment. It provides completeness to Hadoop's storage layer to enable fast analytics on fast data.[3]

Apache Kudu
Other namesKudu
Developer(s)Apache Kudu Committers and PMC Members
Stable release
1.16.0 / 11 April 2022; 2 years ago (2022-04-11)[1]
RepositoryKudu Repository
Written inC
Operating systemLinux, macOS
TypeDatabase management system, Distributed data store
LicenseApache License 2.0[2]
Websitekudu.apache.org Edit this on Wikidata

The open source project to build Apache Kudu began as internal project at Cloudera.[4] The first version Apache Kudu 1.0 was released 19 September 2016.[5]

Comparison with other storage engines

edit

Kudu was designed and optimized for OLAP workloads. Like HBase, it is a real-time store that supports key-indexed record lookup and mutation.[6] Kudu differs from HBase since Kudu's datamodel is a more traditional relational model, while HBase is schemaless. Kudu's "on-disk representation is truly columnar and follows an entirely different storage design than HBase/Bigtable".[6]

See also

edit

References

edit
  1. ^ "Apache Kudu - Releases". Retrieved 27 September 2022. Kudu 1.16.0 was released on Apr 11, 2022.
  2. ^ "Project Status". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21. Is Kudu open source? Yes, Kudu is open source and licensed under the Apache Software License, version 2.0. Apache Kudu is a top level project (TLP) under the umbrella of the Apache Software Foundation.
  3. ^ "Home". kudu.apache.org.
  4. ^ "Why was Kudu developed internally at Cloudera before its release?". 2017-05-21. Retrieved 2017-05-21.
  5. ^ "Apache Kudu releases". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21. Kudu 1.0.0 was released on September 19, 2016. It is the first release not considered "beta". [...] Kudu 0.5.0 (beta) was released on Sep 28, 2015. It was the first public version of Kudu.
  6. ^ a b "Why build a new storage engine? Why not just improve Apache HBase to increase its scan speed?". 2017-05-21. Archived from the original on 2017-05-21. Retrieved 2017-05-21.
edit