JindoData is a self-developed data lake storage acceleration suite by Alibaba Cloud's Big Data team, designed for big data and AI ecosystems. It provides comprehensive access acceleration solutions for both Alibaba Cloud and major industry data lake storage systems.
Built on a unified architecture and kernel, JindoData supports access through the all-purpose SDK (JindoSDK) to OSS/OSS-HDFS. JindoSDK is compatible with HCFS interfaces, object storage interfaces, and POSIX interfaces, as well as supporting Python and TensorFlow, along with a full ecosystem of compatible tools (JindoCli, JindoFuse, JindoDistCp) and plugins.
JindoSDK serves as the standard client for accessing JindoData components. To install and verify it, refer to JindoSDK Download and JindoSDK Quick Start. For multi-platform support, consult JindoSDK Multi-Platform Support.
As an actively updated client providing continuous updates with new features and performance improvements for Alibaba Cloud EMR data lakes, we recommend users stay up-to-date with the latest JindoSDK version for ongoing support and optimal experience. A convenient script is available to assist in upgrading JindoSDK across your cluster; please refer to the JindoSDK Upgrade Documentation.
See Using JindoSDK in Hadoop Ecosystem
See Using JindoSDK in AI Ecosystem
See Jindo Python SDK Quick Start
See Using JindoTensorFlowConnector
See Overview of Using JindoRuntime with Fluid
See Authentication in JindoData
See Quick Start for JindoDistCp
See Using JindoTable SetStorage Class
See OSS-HDFS Service (JindoFS) Client Tools Overview
Refer to the JindoData FAQs
Check out the JindoSDK Release Notes for version information.