Six month ago we adopted Java 23 as requirement, following our standard procedure to upgrade with each Java version as soon as it becomes available. This allows us to take advantage of all the great improvement each release brings. The upgrade to 23 was pretty easy since the changes from 22 to 23 were not that big. The story turns out to be a bit different now with our upgrade to Java 24.| Trino Blog
What a long journey it has been! From the start Trino supported querying Hive data and used libraries from the Hive and Hadoop ecosystem. With the release of Trino 470 we mark another milestone to more features and better performance for data lake and lakehouse querying with Trino. We deprecated the legacy file system support, and will permanently remove them in an upcoming release.| Trino Blog
Wow, what an amazing year 2024 was for Trino! Martin Traverso presented about the achievements and progress of the project at the recent Trino Summit 2024. Let me dive deeper into the content of his keynote and elaborate some more about our amazing plans for the future.| Trino Blog
What a view we had at the summit! Over 700 live attendees enjoyed the sessions and learned more about Trino-related use cases and projects. Now it is time for the additional 1000 registrants, our 13000+ Trino users on Slack, and everyone else in the Trino community and beyond to enjoy the presentations and recordings at their leisure.| Trino Blog
Apache Ranger has arrived! With the new Trino 466 you all get another jam-packed release of Trino awesomeness. One of the goodies is a new plugin for access control for your data with Apache Ranger, and it has gone through a long story to get here. Apache Ranger has a long history and wide adoption as an access control system for data lakes using Hadoop and Hive. Since Trino brings fast analytics to this space, and also supports modern data lakehouses and other data sources, Apache Ranger is ...| Trino Blog
We just wrapped up our mini training series SQL basecamps before Trino Summit, and now Trino Summit 2024 is less than three busy weeks away. It’s a good thing that we have also been working hard on all the preparations for the summit. Everything is coming together, and we are excited to share the full lineup for the free, virtual, two day event today.| Trino Blog
Trino Summit is inching closer fast, and we are busy with all the preparation. Nevertheless, we thought we bring you some more SQL and Trino-related training. The two live classes from our SQL basecamps before Trino Summit are now available for you all to enjoy, just in case you missed it.| Trino Blog
Trino is written in Java. Trino contributors and maintainers are often veterans in the Java ecosystem and community, and Trino is very modern when it comes to Java. For example, Trino now requires the latest Java version and actively uses new features. When it comes to JavaScript however, the story is a bit more complicated. Of course, JavaScript is commonly used in the Trino ecosystem and codebase. Let’s look at some of the specifics.| Trino Blog
Our efforts around Trino Summit 2024 are ramping up and the event is creeping closer and closer. We are really looking forward to the two-day, free, virtual event in December about all things Trino. While we are working hard to put together the SQL basecamps before Trino Summit training sessions and other community events, a number of your awesome peers from the Trino community submitted session proposals, and we are excited to share that glimpse on the agenda for Trino Summit 2024.| Trino Blog
Trino is deployed everywhere – on-premise, in private data centers, in the cloud with hosting providers, on bare metal servers, on virtual machines, and with containers. With all these options for deployments, a Kubernetes-based platform with a container emerged as the most widely used approach. The Trino project caters for this usage with our container images for every release and our Helm chart. However we keep hearing from people who want to use a Kubernetes operator…| Trino Blog
Later in December your knowledge of our Trino SQL query engine will certainly peak again at Trino Summit 2024. To reach those heights and absorb all there is to learn at Trino Summit, you need to get ready. That is why I teamed up with our Trino creators and BDFLs – Martin Traverso, Dain Sundstrom, and David Phillips. We aim to be your coaches and trainers to get you ready and get to the summit without the need for oxygen masks and sherpas. Join us for the “SQL basecamps before Trino Summ...| Trino Blog
Only about ten Trino releases or six months ago, we released Trino 447 with the requirement to use Java 22. In recent releases we started to take more and more advantage of features that are only available with that upgrade. We made some big steps in terms of performance and talked talked about some of those performance enhancements around aircompressor in the recent Trino Community Broadcast 65. The Java community runs its release processes on a very predictable schedule - March and Septembe...| Trino Blog
Fresh off the heels of Trino Fest 2024, where Commander Bun Bun was busy meeting the Trino community in-person, we’re already looking forward to another, bigger event to round out the year in Trino. For those who’ve been here a while, you know that can only mean one thing: Trino Summit 2024. Much like last year, it will be a two-day, fully virtual event, hosting a wide range of talks covering all things Trino on the 11th and 12th of December. Read on for more info, or if you’re already ...| Trino Blog
Trino Fest 2024 is successfully in the books! While over 100 enthusiastic members of the community gathered in Boston, over 650 virtual attendees joined us worldwide to learn from our expert speakers as they discussed topics such as table formats, enhancements and optimizations, and use cases with Trino both large and small. And now it is your chance to revisit the presentations or catch up on everything you missed.| Trino Blog
This week has surely started off with a big bang and another boom in the data platform world. Snowflake introduced the open source Polaris catalog as implementation of the Iceberg REST catalog specification. And Databricks, the main driver of the Delta Lake table format, announced their acquisition of Tabular, a main driver in the Apache Iceberg community. Interestingly enough, Trino is in the middle of all this with great support for Delta Lake, Hudi, Iceberg, and also the Iceberg REST catal...| Trino Blog
We gave a sneak peek of the Trino Fest lineup a month ago, and we’re excited to now bring you the full lineup for the event. We’ve got some major names being added, including Amazon, Microsoft, and another talk from Apple. With Fourkites and a joint talk with LanceDB and CharacterAI also added to the schedule, we’re excited to present the full lineup for Trino Fest 2024. Trino Fest is barely a month away on the 13th of June, and whether you want to attend live in Boston or tune in virtu...| Trino Blog
Trino Fest is drawing ever closer. Commander Bun Bun has been hard at work behind the scenes arranging the schedule and making sure that Trino’s trip to Boston is going to be a great one. In case you missed it, we announced Trino Fest a couple months ago, and if you have missed it, make sure to go register to attend! All our speakers will be in person in downtown Boston on the 13th of June, with plenty of opportunities for networking and a happy hour event at the end of the day. But if you ...| Trino Blog
Exciting news - time travel capability has finally arrived in the Delta Lake connector! After introducing support for time travel in the Iceberg connector back in 2022, we’re thrilled to announce that the Delta Lake connector now joins the ranks as the second connector offering this feature.| Trino Blog
It was not that long ago that we first announced support for Java 21, and subsequently made it a build and runtime requirement with Trino 436. Since then, the codebase received some significant improvements in readability, and we have also seen better performance. However, innovation in Trino and Java is not holding still, on the contrary - it’s accelerating. On the Java community side, Java 22 is just about to be released, and we think it is time to drive innovation in Trino even further. ...| Trino Blog
Thinking about our recent work on caching in Trino reminds me of the famous saying, “There are only two hard things in computer science: cache invalidation and naming things.” Well, in the Trino community we know all about caching and naming. With the recent Trino 439 release, caching from object storage file systems got a refresh. Catalogs using the Delta Lake, Hive, Iceberg, and soon Hudi connectors now get to access performance benefits from the new Alluxio-powered file system caching.| Trino Blog
Do you know where the name ‘Trino’ comes from? It’s actually a shortened form of ‘neutrino’. These fast and lightweight subatomic particles have recently made their way to Japan. You can now reserve your copy of the Japanese edition of Trino: The Definitive Guide!| Trino Blog
After the resounding success of Trino Fest and Trino Summit in 2023, Commander Bun Bun has exciting news to share: we’re taking our biggest events of the year back to being in-person. They’ll be hybrid, to be more specific, so if you can’t travel, don’t fret, you’ll still be able to watch and ask questions in chat. But if you can travel, you won’t want to miss out! Everything you already know and love about Trino Fest is moving to the East Coast for the lovely Boston summer. The e...| Trino Blog
Trino now ships with an access control integration using the popular and widely used Open Policy Agent (OPA) from the Cloud Native Computing Foundation. The release of Trino 438 marks an important milestone of the effort towards this integration.| Trino Blog
If “Wrapped” is good enough for Spotify, it’s good enough for Trino, right? As we look forward to a bright 2024, we can also take a moment to get sentimental, look back at everything we’ve accomplished, and reflect on the progress we’ve made. Commander Bun Bun has been hard at work, so if you haven’t been paying close attention to Trino or want an idea of all that went down in 2023, we’re happy to present you with an end of year recap. We’ll be exploring what’s gone on in th...| Trino Blog
Two days of non-stop Trino action are done! Last week, Trino Summit 2023 took place virtually another great community event. Great presentations from Trino experts across the globe showed different use cases and experiences with Trino.| Trino Blog
Are you ready? Trino Summit 2023 is just two days away, and our lineup of speakers, sponsors, and activities is truly amazing. Make sure to register and join us live.| Trino Blog
In the fourth part of our training series Learning SQL with Trino from the experts Martin Traverso, Dain Sundstrom and I took on the big topic of aggregation functions, and covered the two new and exciting features of table functions and SQL routines.| Trino Blog
As winter nears, the days may be getting shorter, but so is the wait until Trino Summit 2023! It’ll be here before you know it on December 13th and 14th. We’ve got a packed speaker lineup full of exciting talks, and we’re ready to share some details with the Trino community today. Read on for a preview of some talks, and if you’re interested in attending, make sure to… Register!| Trino Blog
In the third part of our training series Learning SQL with Trino from the experts David Phillips and I changed gears from reading data and performing analytics with Trino. We looked the the topic of write operations. We covered creating catalogs, schema, tables, and then inserting and updating data, and talked about related topics such as data source and connector support.| Trino Blog
We want to see the best SQL routines you can write, feature them as examples in the documentation, and send you some goodies as a reward!| Trino Blog
We’re excited to announce that as of version 432, Trino can run with Java 21. In fact, the Trino Docker image uses Java 21 now. We have done upgrades to newer Java LTS versions successfully before when we upgraded to Java 11 and then Java 17 with Trino 390. Each time the improvements to the JVM runtime, the garbage collectors, the involved libraries, and the dependencies resulted in performance gains that came nearly for free. And each time we were able to take advantage of new language con...| Trino Blog
In the second part of our training series Learning SQL with Trino from the experts Martin Traverso and I built on top of the foundational knowledge from the first training session. We continued to learn more about data types and working with them, including the important strings, numeric, temporal, and JSON types.| Trino Blog
In our training series Learning SQL with Trino from the experts Martin Traverso, Dain Sundstrom, David Phillips, and myself will run through the wide range of SQL support and features of Trino with our audience. In the first episode, we covered the concepts of Trino and SQL, and then started to learn some basic SQL. Now you can take advantage of the recording and available resources to learn at your own pace.| Trino Blog
The Trino community in Japan held an online event on October 5th, 2023. This article is a summary of the conference aiming to share the presentations and provide an overview.| Trino Blog
You started with one Trino cluster, and your users like the power for SQL and querying all sorts of data sources. Then you needed to upgrade and got a cluster for testing going. That was a while ago, and now you run a separate cluster configured for ETL workloads with fault-tolerant execution, and some others with different configurations. With Trino Gateway we now have an answer to your users request to provide one URL for all the clusters. Trino Gateway has arrived!| Trino Blog
Do you have a rough idea of what SQL is? Do you need to get data out of object storage in the cloud and some relational database at the same time? You should look at Trino and learn about SQL. Or do you know the ins and outs of joins, window functions, and your SQL queries are counted by the pages and not lines? You may even be the expert on SQL on your team. You should also look at Trino and SQL. Luckily for you all, we have the right SQL training for everyone in our upcoming series with the...| Trino Blog
Trino, Trino, Trino everywhere. Just looking at our website stats and the users in our community chat, we know that Trino is going places. We also know that one of these places with a large user community is China. And now we have good news for you. A translation of the second edition of the book to Chinese is now available.| Trino Blog
The Trino community is buzzing. Commander Bun Bun is ready to invite you all to join us for Trino Summit 2023. And “all” really means everyone in the community. The event is free to attend, virtual, and full of news and shared knowledge from your peers using Trino. Don’t hesitate to submit your talk and register to attend now.| Trino Blog
Fugue may be an unfamiliar name to those in the Trino ecosystem. It’s another Python tool, a programming model built to enhance interoperability between Python and SQL. On the Python side of things, it’s a wrapper around common tools like pandas and Polars that convert code into SQL for high-performance, large-scale query execution. So why are we talking about it at Trino Fest? Because Fugue recently launched an integration with Trino, enabling you to write Python code that can be convert...| Trino Blog
Let’s cut straight to the chase with this lightning talk from Benjamin Jeter, a data architect, platform manager, and data engineer at Datto. For those that are not familiar with Datto, they are an American cybersecurity and data backup company. They’re the leading global provider of security and cloud-based software solutions purpose-built for Managed Service Providers (MSPs). In Benjamin’s talk, he goes through some of the considerations and design goals of a reference architecture pa...| Trino Blog
By 2025, there will be 100 zetabytes stored in the cloud. That’s 100,000,000,000,000,000,000,000 bytes - a huge, eye-popping number. But only about 10% of that data is actually used on a regular basis. At Uber, for example, only 1% of their disk space is used for 50% of the data they access on any given day. With so much data but such a small percentage being used, it raises the question: how can we identify frequently-used data and make it more accessible, efficient, and lower-cost to acce...| Trino Blog
For those unfamiliar, Stripe is an online payment processor that facilitates online payments for digital-native merchants. They use Trino to facilitate ad hoc analytics, enable dashboarding, and provide an API for internal services and data apps to utilize Trino. In Kevin Liu’s session at Trino Fest 2023, he showcases the Trino Iceberg connector and how it can replace more complex usage to access Iceberg metadata. He also discusses how Trino is a core part of operations at Stripe.| Trino Blog
At Comcast, data is used in a data mesh ecosystem, with a vision where users can discover data and request data through a self-service platform. With federation, various tools, and the ability to create, read, and write data with different platforms, it’s a full-blown data mesh. So how do you build that? With Trino, of course, and with the power of Hive views. Tune into the 10-minute lightning talk that Alejandro gave at Trino Fest to learn more about how Comcast pulled it off.| Trino Blog
The need to make blockchain data easily accessible has risen over the recent years due to the popularity of cryptocurrencies, NFTs, and other uses of blockchains. Dune has made it their mission to make blockchain data more accessible. Dune is a community data platform for querying public blockchain data and building beautiful dashboards. They use their own query engine called DuneSQL, built as extension of Trino, to query blockchain data. In the session, Miguel and Jonas from Dune talk about ...| Trino Blog
In this recap, we can skip right to the exciting part: through the joint efforts of engineers at ForePaaS and Bloomberg, there is a Snowflake connector coming to Trino! Though it hasn’t landed yet, it has been tested and run in production at both companies, and a pull request is open and working its way towards completion as this blog post goes up. In the talk, Yu and Erik talk about difficulties in developing the connector, the motivations to make it happen, and the new features that come ...| Trino Blog
Ever since the pandemic, it has become clear that the need for a digital first economy is becoming more and more necessary. As Redis’ Field CTO Allen Terleto said during their talk from Trino Fest 2023, “In a digital first economy, data is the lifeblood of the organization, which makes the databases the heart of enterprise architectures”. Redis, a popular open source project, is a distributed in-memory key–value database. It includes a cache, message broker, and optional durability. I...| Trino Blog
Optimizing data access and query performance is crucial to building low-latency applications and running analytics. Even with the modern data lakehouse designed to be as efficient and performant as possible, there are a number of bottlenecks that can slow things down and plenty of challenges to overcome. Nadine and Sagar explored this at Trino Fest, introducing us to multi-modal indexing and the metadata table in Hudi, how they work, and how leveraging them with Trino can unlock queries faste...| Trino Blog
Arctic Wolf Networks, a cybersecurity company that provides security monitoring to cyber threats, is one of the companies that have recently switched to using AWS Athena as a new and efficient service to query their data using Trino. AWS Athena is a serverless, interactive analytics service built on open-source frameworks that runs on Trino, supporting open table and file formats and providing a simplified, flexible way to analyze petabytes of data where it lives. Senior software developer An...| Trino Blog
The PyData stack has been described as “unreasonably effective,” empowering its users to glean insights and analyze moderate amounts of data with a high level of flexibility and excellent visualization. The large-scale, production data stack using a query engine like Trino sits on the other side of the world, capable of handling petabytes and exabytes, but perhaps not integrating as seamlessly with the Python ecosystem as one would hope. SQL has been a means of bridging this gap, but we...| Trino Blog
Have you ever wanted to keep your data in a table and have an efficient way to interact with them? Iceberg, an open standard table format, is exactly what you need. One of the great and unique features of the Iceberg table format is its support for change data capture (CDC). Co-creator of Apache Iceberg, Ryan Blue, presented at Trino Fest 2023 this past week detailing the CDC support and the trade-offs between different patterns that can be used for writing CDC streams into Iceberg tables.| Trino Blog
Let’s say you have some data. Maybe it’s in a spreadsheet, a CSV file, a relational database, or multiple terabytes of data in an S3 bucket. You need to run SQL queries on this data, and you’d like to share those results with your teammates, coworkers, and partner teams, but you want to do it in a way that allows everyone to view those results on-demand, on the web, and with the latest results without the need for any manual effort on your part.| Trino Blog
Rolling into our next presentation from Trino Fest 2023, we’re excited to bring you Tuli Navas and Geeta Shankar’s talk from the Performance Engineering Team at Salesforce. They provide numerous reasons for why they need Trino and further explain how it is essential for anomaly detection in their data. It’s an insightful talk about using a query engine to ensure data quality and how switching to Trino has massively improved their performance. You definitely don’t want to miss it.| Trino Blog
Trino Fest 2023 got off to a bang, as Trino co-creator and maintainer Martin Traverso gave an update on all the amazing things that have happened to Trino since Trino Summit last year. He also provided some insight into what’s coming down the pipeline for Trino, with a brief look at the project’s roadmap. You can watch the recording of the talk if you want to see for yourself, or you can read on for the highlights.| Trino Blog
Last week we held Trino Fest, and it kept us all so busy, we forgot to spend time chilling by the lakehouse! Great demos, amazing announcements, new plugins, and use cases reached our active audience. Thanks go to our event host and organizer Starburst, to our sponsors AWS and Alluxio, to our many well-prepared speakers, and to our great live audience. Now you get a chance to catch up on anything you missed.| Trino Blog
Trino Fest is just around the corner! We’re only two weeks away, and we’re excited to share that we’ve got an incredible speaker lineup with a wide variety of talks about all things Trino. If you’re out of the loop, we announced Trino Fest back in April as a two-day, free, virtual event. If you want to attend, see talks live, engage with our speakers in Q&As at the end of each session, you’ll need to register, so don’t delay, and… Register to attend! With that said, we’re also...| Trino Blog
Last week, I had the pleasure to attend Open Source Summit North America 2023 in Vancouver. A quick hop across the Strait of Georgia got me right into the event and into the midst of my peers of open source developers, advocates, and enthusiasts.| Trino Blog
Summer is just around the corner, and we are busy getting ready for Trino Fest 2023. Everything is ramping up. Early birds are starting to register, and so should you. Our Trino Fest theme song is available for your listening pleasure, and we are reviewing speaker submissions. The festival is promising to be another great event to learn about lakehouse use cases with Trino, but we are also featuring some great presentations for querying data with Trino. And of course, we are still looking for...| Trino Blog
In the data lake world, data partitioning is a technique that is critical to the performance of read operations. In order to avoid scanning large amounts of data accidentally, and also to limit the number of partitions that are being processed by a query, a query engine must push down constant expressions when filtering partitions.| Trino Blog
For those who are paying close attention, you may notice updates to a few pages across the Trino website with a renewed focus on leadership roles in Trino. This is part of an effort to re-focus and make the operating model more transparent both for contributors and for end users. While this is not a functional change, this does involve clarifying our roles following the BDFL (benevolent dictator for life) model.| Trino Blog
At this stage Trino is used all around the globe as we know from the community chat and our speakers at Trino Summit 2022. One large community of Trino contributors and maintainers, many employed by Starburst, is located in Poland. Poland also has a very active participation of developers and users in the Java and Big Data communities.| Trino Blog
Get ready to kick off your summer with Commander Bun Bun at Trino Fest 2023! This year’s event is going virtual and will take place over two days, the 14th and 15th of June. The focus of the event will be on Trino as a data lakehouse query engine, with discussions on how new features and the ecosystem around Trino can support better data lakehouse management. Trino Fest 2023 is the new annual summer event dedicated to all things Trino. Building on the success of last year’s Cinco de Trino...| Trino Blog
It’s that time of the year where everyone gives excessively broad or niche predictions about the finance market, venture capital, or even the data industry. And we are now bombarded with “year-in-review” summaries where we find out just how much data is being collected to generate those summaries. End-of-year reflections are always useful because you can find patterns of what’s going well and what’s going poorly. It’s also good to pause and take stock of the things that did go wel...| Trino Blog
At some point in the lifecycle of a successful open source project, it reaches a point where the number of incoming pull requests (PRs) outpace the project’s ability to get code merged. It happens for a huge variety of reasons, including developers moving on to other projects before tying up every loose end, reviewers who miss a request for review, and because some stagnant PRs were never going to happen and should have been closed two years ago. The GitHub notification system doesn’t do ...| Trino Blog
As the holiday season approaches, we have reached the end of our Trino Summit 2022 recap posts. With the last talk of the summit, Mei Long from Upsolver gave an insightful overview of how they use data to inform product decisions.| Trino Blog
As we close in on the final talks from Trino Summit 2022, this next talk dives into how to set up Trino for batch processing. Trino has historically been well-known for facilitating fast adhoc analytics queries as opposed to long-running, resource intensive batch/ETL queries. This is due to the fact that Trino kills queries that run out of resources in order to prioritize faster query execution. Earlier this year, Trino added features to better support batch queries with a new fault-tolerant ...| Trino Blog
This post comes from the second half of Trino Summit 2022 session. Our friends JaeChang and Jennifer from SK Telecom traveled across the globe from South Korea to join us in person! SK Telecom recently had some issues scaling Trino on the Hive model, among other issues that come with Hive. While some initial tweaking helped speed things up, it ultimately never solved the problem. After switching to Iceberg, SK Telecom ran initial performance tests with some very impressive results. In this ta...| Trino Blog
As we near the end of the Trino Summit 2022 recap series, it’s time to take a stop at Quora. At Quora, being an engineer responsible for maintaining Trino comes with its fair share of challenges. With concerns about cost, performance, and reliability, Quora has taken several creative steps to ensure that they get the most out of Trino. Other Trino users may be able to learn a few neat tips and tricks to do the same by tuning in.| Trino Blog
As the Trino Summit 2022 recap post series continues on, I have been reading all the wonderful posts by our awesome speakers, facilitated by the Trino developer relations team. Because I have a perpetual fear of missing out, I convinced them that I should get in on the fun. For this latest installment in the series, I will be recapping my very own Trino Summit talk. Basically, I’m ripping off Bo Burnham’s comedy bit where he reacts to his own reaction video, blog style. In this session, I...| Trino Blog
Buckle up, for the next post in the Trino Summit 2022 recap series. In this post, we’re covering the talk given by Lyft engineers, Charles and Ritesh, on how they have not only scaled Trino as adoption grew, but with less nodes and more effective usage. They also started moving to utilizing Trino more for ETL rather than just interactive analytics. Get ready for a smooth ride as Lyft brings you large scale ETL with Trino.| Trino Blog
Rolling right along with another one of our Trino Summit 2022 recap posts, we’re excited to bring you the engaging talk from Marc Laforet at Shopify. He talked about the ordeal (or, if you look at it in a positive light, the privilege) of migrating petabytes of data from Hive to Iceberg table formats with the help of Trino. With details on why Shopify chose to move to Iceberg, the various migration strategies that were considered, and the ultimate process of moving all that data while the T...| Trino Blog
Tune in for the next post in the Trino Summit 2022 recap series. In this post, we’re joining Saj from Comcast, to talk about their migration from a data fabric to data mesh. Saj shows you that there is more to the buzzword than meets the eye. He gives a solid overview of why Comcast is taking data mesh to heart.| Trino Blog
Sometimes when working in the depth of the query engine core, the planner, the optimizer, or some other tricky problem or code, you run out of steam and need a distraction. Your brain will quietly work on the problem in the background while you get to read or do something interesting. During one break we found out more about GPT-3 and the recent improvements on the OpenAI project. So we had to try.| Trino Blog
Continuing with the Trino Summit 2022 sessions posts, we’re diving into an insightful lightning talk from Goldman Sachs. They explore how they use Trino to help ensure data quality across the board for all users and customers. By using Trino to federate their various data sources, querying everything in one place provides them with the flexibility they need. With that flexibility, they can validate that all data is as it should be where that data lives, settling any concerns that may exist ...| Trino Blog
In this installment of the Trino Summit 2022 sessions posts, we jump into an exciting topic by folks from Zillow about running Trino on spot instances. Spot instances are cheap and ephemeral nodes that lead to reduced overall compute costs. Spot instances are cheaper as they are not guaranteed to remain available. In this session, Zillow engineers talk about how they use Trino on spots to take advantage of the cost savings while handling the transitory nature of spots.| Trino Blog
Our community just keeps growing! Today, it is time to reach out and welcome another large group of Trino users. The release of the new engine version for Amazon Athena upgrades Athena to a recent version of Trino from a rather old version. This update brings a ton of improvements from the Trino project to the users of the popular cloud-based query service.| Trino Blog
This post continues a larger series of posts on the Trino Summit 2022 sessions. Following the Trino at Apple talk, engineers from Bloomberg shared the latest about their additions to Trino. Bloomberg uses Trino to federate huge amounts of disparate financial data together. When you have many users with different use cases and resource needs, you need something to ensure that the huge workloads don’t bully the small ones. Enter the Trino Load Balancer, a privacy-aware solution to help mainta...| Trino Blog
This post continues a larger series of posts on the Trino Summit 2022 sessions. Following the Keynote: State of Trino session, engineers from Apple shared the current usage of Trino at Apple. They discuss how they support Trino as a service for multiple end-users, and the critical features that drew Apple to Trino. They wrap up with some challenges they have faced and some development they have planned to contribute to Trino.| Trino Blog
To kick off the Trino Summit 2022, we heard from Trino co-creators Martin Traverso, Dain Sundstrom, and David Phillips. Martin gave a talk on the state of Trino and project plans for 2023, then opened the floor to questions from the community. You can watch a recording of the talk, or read on if you’re only interested in the highlights.| Trino Blog
Trino Summit 2022 was in a word, invigorating. I’m still coming off the high from the amount of energy I gained from being at this summit, meeting many of you face-to-face for the first time. Most surprisingly, I learned that Trino contributor James Petty from AWS was actually not famous painter Bob Ross.| Trino Blog
This blog post wraps up a series of previous posts teasing Trino Summit 2022. The conference is free and takes place in San Francisco, California on November 10th. Join us either in-person or virtually! Register now| Trino Blog
Trino has long been the de facto standard to querying large data sets over your cloud or on-prem storage, also known as data lakes. This Trino Summit’s theme instead will showcase Trino’s other claim to fame: query federation. Trino is a query engine providing an access point that exposes ANSI SQL across multiple data sources. I urge you to join us either in-person or virtually if you are a fan of Trino, big data, open source, data engineering, Java, or all the above! This conference is f...| Trino Blog
It was time for a refresh. A little while ago in April 2021, we announced the Trino version of our definitive guide. But again, Trino as a project and community has continued to innovate and grow. Numerous smaller and larger details changed, and the examples and resources needed to be fixed. Today, we are happy to announce that after a few months of updates, testing, and editing, the second edition of Trino: The Definitive Guide is available. Get a free copy from Starburst now!| Trino Blog
Commander Bun Bun is back and this year we have an exciting lineup of speakers. Topics range from architectures like data mesh and data lakehouse, to running Trino at scale with fault-tolerant execution, and query federation. This conference is free and takes place on November 10th. The summit is a hybrid event for in-person and virtual attendance. Find out more details below!| Trino Blog
Wow, have we ever come a long way with Python support for Trino. It feels like ages ago that we talked about DB-API, trino-python-client, SQLAlchemy, Apache Superset, and more in Trino Community Broadcast episode 12. More recently we talked about dbt in episode 21 and episode 30, but there is so much more for Pythonistas, Pythonians, Python programmers, and simply users of Python-powered tools.| Trino Blog
What an exciting month we had in August! August marked the ten-year birthday of the Trino project. Don’t worry if you missed all the excitment as we’ve condensed it all in this post.| Trino Blog
An important aspect of a good data pipeline is ensuring data quality. You need to verify that the data is what you’re expecting it to be at any given state. Great Expectations is an open source tool created in Python that allows you to write detailed tests called expectations against your data. Users write these expectations to run validations against the data as it enters your system. These expectations are expressed as methods in Python, and stored in JSON and YAML files. One great advant...| Trino Blog
It’s inspiring and mindblowing to reflect on the ten year journey that has produced the community around Trino. Trino is the community-driven fork from Presto, the distributed big data SQL query engine created at Facebook in 2012. We are a community of engineers, scientists, analysts, and visionaries that work in a fast paced world where the expectations on the time to insights from our analytics and the scale of the data are ever-increasing. Sometimes words only do so much justice to encom...| Trino Blog
It’s amazing how far we have come! Our massively-parallel processing SQL query engine, Trino, has really grown up. We have moved beyond just querying object stores using Hive, beyond just one company using the project, beyond usage in Silicon Valley, beyond simple SQL SELECT statements, and definitely also beyond our expectations. Let’s have a look at some of the great technical and architectural changes the project underwent, and how we all benefit from the commitment to quality, opennes...| Trino Blog
It might surprise some that our departure from Facebook was one of the simplest decisions we’ve ever made. Many posts that discuss leaving a FAANG company focus on leaving some grand sum of money or prestige of working at the company. For us, we were leaving the company where we had launched a project that we knew would quickly outgrow the walls of Facebook, and solve a much larger set of problems in the analytics domain. At the time we didn’t quite anticipate that Presto, a distributed S...| Trino Blog
In the Trino community, we know that being the coolest query engine is a tough job. We boldly face the intricacies of the SQL standard to bring you the newest and most powerful features. Today, we proudly announce that as of release 381, Trino is on its way to full support for polymorphic table functions (PTFs). In this blog post, we are explaining the concept of table functions and exploring how they can be leveraged. We also look at what we have already implemented, and take a sneak peek in...| Trino Blog
You’ve already read the title, and it’s exciting news - as of Trino version 390, which releases today, Trino has officially been updated from Java 11 to Java 17. This has a few implications, the most important of which is that if you aren’t running the Docker image (which automatically comes with the correct version of Java) and you’ve been running Trino on Java 16 or older, you’ll need to update Java to run Trino versions 390 and later. It’s also worth mentioning that newer versi...| Trino Blog
The recent addition of the fault-tolerant execution architecture, delivered to Trino by Project Tardigrade, makes the use of Trino for running your ETL workloads an even more compelling alternative than ever before. We’ve set up a demo environment for you to easily give it a try in Starburst Galaxy.| Trino Blog
We are pleased to announce the upcoming 2022 Trino Summit. The summit is scheduled as hybrid event on the 10th of November 2022, and attendance is free! You will be able to join us online, or you can make the trip to San Francisco and meet us at the Commonwealth Club on the downtown waterfront. Please be aware that spots at the live event are limited, so register soon if you want to attend. Please also be aware that you need to register regardless of whether you’ll be joining us in-person o...| Trino Blog
This past week, Andrii Rosa hosted a virtual Trino meetup on the topic of using Trino as a batch processing engine. You can view the talk from the meetup embedded below. Andrii dives into the history of Trino as an engine for Batch ETL (extract, transform, load) processing, some challenges related to that, as well as the new fault-toleration execution capabilities being added to Trino and how they improve it for Batch ETL use cases.| Trino Blog
At QazAI, we build data lakes as a service for companies. In the original architecture, we get raw data in S3, transform the S3 data with Hive, and then delivered the data to business units via our datamart built on Clickhouse (for optimal delivery speeds). Over time, we were dragged down by the slower speeds and high costs of running Hive, and started shopping for a faster and cheaper open source engine to do our ETL data transformations.| Trino Blog
Maximizing your experience with zero choices. I’m publishing this blog post in partnership with the Trino community to go along a lightning talk I’m giving for their event, Cinco de Trino. This article was originally published on Abhi’s Medium site “My data is all over the place and attempting to analyze or query it is not only time consuming and expensive, but also emotionally taxing.”| Trino Blog
When Trino (formerly PrestoSQL) arrived on the scene almost 10 years ago, it immediately became known as the much faster alternative to the data warehouse of big data, Apache Hive. The use cases that you, as the community, have built had far exceeded anything we had imagined in complexity. Together we’ve made Trino not only the fastest way to interactively query large data sets, but also a convenient way to run federated queries across data sources to make moving all the data optional. At C...| Trino Blog
After six months of challenging work on Project Tardigrade, we are ready to launch. With the project we improved the user experience of running resource intensive queries that are common in the Extract, Transform, Load (ETL) and batch processing space. It required some significant and fascinating engineering to get us to the current status. The latest Trino release includes all the work from Project Tardigrade. Read on to learn how it all works, and how to enable the fault-tolerant execution ...| Trino Blog
Over the last couple of months we’ve added support for full query retries, landed experimental support for task level retries and provided a proof of concept implementation of a distributed exchange plugin (description below). We are still working on improving scheduling algorithms as well as optimizing exchange plugin implementation to make the task level retries fully usable.| Trino Blog
The software and the community you have come to love and depend on aren’t going anywhere, we are simply renaming.| trino.io