Data Lakes
ποΈ Catalog
10 items
ποΈ External table
From v3.0 onwards, we recommend that you use catalogs to query data from Hive, Iceberg, and Hudi. See Hive catalog, Iceberg catalog, and Hudi catalog.
ποΈ File external table
File external table is a special type of external table. It allows you to directly query Parquet and ORC data files in external storage systems without loading data into StarRocks. In addition, file external tables do not rely on a metastore. In the current version, StarRocks supports the following external storage systems: HDFS, Amazon S3, and other S3-compatible storage systems.
ποΈ Data Cache
This topic describes the working principles of Data Cache and how to enable Data Cache to improve query performance on external data.
ποΈ Data lake FAQ
This topic describes some commonly asked questions (FAQ) about data lake and provides solutions to these issues. Some metrics mentioned in this topic can be obtained only from the profiles of the SQL queries. To obtain the profiles of SQL queries, you must specify set enable_profile=true.
ποΈ Feature Support
From v2.3 onwards, StarRocks supports managing external data sources and analyzing data in data lakes via external catalogs.