Software
Amazon AWS SageMaker Unifies Data Control in 2024
2024-12-03
Amazon Web Services (AWS) has been at the forefront of cloud computing for nearly a decade. Their platform, SageMaker, has been a key player in creating, training, and deploying AI models. This year, however, the focus shifted towards streamlining. At the re:Invent 2024 conference, AWS unveiled SageMaker Unified Studio, a revolutionary tool that brings together data from across the organization in one place.
Unveiling SageMaker Unified Studio
SageMaker Unified Studio is a single destination for data discovery and work. It combines tools from various AWS services, including the existing SageMaker Studio, to assist customers in preparing and processing data for model building. As Swami Sivasubramanian, VP of data and AI at AWS, stated, "We are witnessing a convergence of analytics and AI, and SageMaker Unified Studio provides the necessary tools in one place."Customers can publish and share data, models, and other artifacts within their teams or the entire organization using SageMaker Unified Studio. The service offers data security controls and adjustable permissions, along with integrations with AWS' Bedrock model development platform.AI is seamlessly integrated into SageMaker Unified Studio through Q Developer, Amazon's coding chatbot. It can answer questions such as "What data should I use to gain a better understanding of product sales?" or "Generate SQL to calculate total revenue by product category." AWS explained in a blog post that Q Developer supports development tasks like data discovery, coding, SQL generation, and data integration within SageMaker Unified Studio.Expanding the SageMaker Product Family
In addition to SageMaker Unified Studio, AWS launched two small yet significant additions to its SageMaker product family. SageMaker Catalog allows admins to define and implement access policies for AI apps, models, tools, and data using a single permission model with granular controls. SageMaker Lakehouse, on the other hand, provides connections from SageMaker and other tools to data stored in AWS data lakes, data warehouses, and enterprise apps.AWS emphasizes that SageMaker Lakehouse works with any tools compatible with Apache Iceberg standards, an open-source format for large analytic tables. Admins have the option to apply access controls across data in all the analytics and AI tools that SageMaker Lakehouse interacts with.Improved Integration with SaaS Applications
In a related development, SageMaker now integrates better with software-as-a-service applications. Thanks to these new integrations, SageMaker customers can access data from apps like Zendesk and SAP without the need for extraction, transformation, and loading. AWS wrote, "Customers often have data spread across multiple data lakes and a data warehouse. A simple way to unify this data would benefit them. Now, they can use their preferred analytics and machine learning tools on their data, regardless of where it is physically stored, to support various use cases including SQL analytics, ad-hoc querying, data science, machine learning, and generative AI."