Updated April 11, 2024

Trino is a query engine tailored for efficient, low-latency analytics. It seamlessly queries exabyte-scale data lakes and vast data warehouses. This ANSI SQL-compliant engine collaborates with BI tools like R, Tableau, Power BI, Superset, and more. Trino excels in use cases ranging from ad-hoc interactive analytics to massive batch queries and high-volume sub-second applications. Its native query capability spans Hadoop, S3, Cassandra, MySQL, and more, eliminating the need for complex data copy processes. With Trino, you can access data from multiple systems within a single query, enabling insightful correlations.

You can deploy Trino in your Nebius AI Managed Service for Kubernetes clusters using this Marketplace product.

Warning

If you are going to use this product in production, we recommend to configure it according to the Trino recommendations.

Deployment instructions

Before installing this product:

  1. Create a Kubernetes cluster and a node group in it. Each node in the group should have at least 8 GB of RAM.
  2. install kubectl, and configure it to work with the created cluster.

To install the product:

  1. Describe Trino connectors for your data sources:

    1. Create a configuration file descibing the connectors. All possible connectors are listed in Trino documentation. Configuration example:

      connector_1: |-
        connector.name=postgresql
        connection-url=jdbc:postgresql://example.net:5432/database
        connection-user=root
        connection-password=secret
      connector_2: |-
        connector.name=redis
        redis.table-names=schema1.table1,schema1.table2
        redis.nodes=host:port
      connector_3: |-
        connector.name=elasticsearch
        elasticsearch.host=localhost
        elasticsearch.port=9200
        elasticsearch.default-schema-name=default
      
    2. Encode the file contents into the Base64 format:

      cat <file_path> | base64
      
    3. Copy the previous command output.

  2. On the cluster page in the management console, go to the Marketplace tab, select the product, and click Install.

  3. Configure the application:

    • Namespace: Select a namespace or create a new one.
    • Application name: Enter an application name.
    • Additional Catalogs: Paste the encoding command output.
  4. Click Install.

  5. Wait for the application to change its status to Deployed.

  6. To check that Trino is working:

    1. Store the coordinator pod name in a variable POD_NAME:

      POD_NAME=$(kubectl get pods -l "app=trino,component=coordinator" -o name)
      
    2. Check the access to Trino’s UI:

      kubectl port-forward $POD_NAME 8080:8080
      
    3. Go to http://localhost:8080 in your web browser to access the UI.

Billing type
Free
Type
Kubernetes® Application
Category
Dataset preparation
Publisher
Nebius
Use cases
  • Interactive data analytics: query large data sets swiftly, test hypotheses, perform A/B testing, and create visualizations or dashboards in real time.

  • High-performance analytics of object storage: SQL-based analytics of HDFS/Hive object storage systems provides unparalleled performance that enables previously impossible or time-consuming analytics, especially when migrating from Hive-based systems to cloud object storage.

  • Centralized data access and analytics with query federation: use one SQL to query multiple data sources such as object storage, relational databases, streaming, or NoSQL systems in a unified way.

  • Batch ETL processing across disparate systems: expedit large extract, transform, load (ETL) processes with standard SQL statements, enabling resource-efficient batch processing that works seamlessly across diverse data sources and targets, making it an invaluable tool for engineers.

Technical support

Nebius AI does not provide technical support for the product. If you have any issues, please refer to the developer’s information resources.

Product composition
Helm chartVersion
Pull-command
Documentation
cr.nemax.nebius.cloud/yc-marketplace/nebius/trino/chart/trino0.18.0Open
Docker imageVersion
Pull-command
cr.nemax.nebius.cloud/yc-marketplace/nebius/trino/trino1708369351087495763923373469719635755577741008056432
Terms
By using this product you agree to the Nebius AI Marketplace Terms of Service and the terms and conditions of the following software: Apache 2.0
Billing type
Free
Type
Kubernetes® Application
Category
Dataset preparation
Publisher
Nebius