Watch on YouTube
Watch on Vimeo
CTERA’s Intelligent Data Platform is designed to unify unstructured enterprise data into a secure fabric that serves as a foundation for AI agents. By transforming the file system into an “agentic coordination layer,” CTERA enables organizations to make their data AI-ready without requiring extensive migration or data movement. Aron Brand, CTO of CTERA, explained that the platform’s content-aware file system goes beyond merely understanding bytes and blocks to understanding the actual content. This enables efficient data preparation for AI agents by removing low-quality data and tagging risky information, thereby streamlining agent operations and eliminating the need for extensive data migration or context rebuilding. CTERA aims to provide an “operating system for agents,” allowing them to reliably understand, trust, and act upon existing enterprise data at petabyte scale.
The CTERA Intelligent Data Platform leverages three core headless services: CTERA Search, CTERA Classify, and CTERA Experts. CTERA Search monitors the global file system for new or modified files, extracting content using various file-type-specific extractors. It then indexes these files into a hybrid full-text and vector database, facilitating robust retrieval based on both data and metadata. This scalable process, built on Kubernetes, continuously indexes data in the background, with discovery tools guiding focus to high-value folders. CTERA Classify builds upon this by labeling and enriching data, using LLMs and other AI models to extract specific schemas from unstructured text (e.g. patient names and dates from medical documents). This derived information is stored as searchable metadata and alternate data streams, which are readily accessible to agents. Finally, CTERA Experts serves as a reasoning layer, enabling agents to answer natural-language questions by querying the underlying search and classified data and acting as a sub-agent within broader agentic platforms. This suite of content services focuses on data curation and preparation for AI, provided as a privately hosted system, not a SaaS offering.
To further enhance data accessibility and integration, CTERA introduced Fusion Direct, a solution for making existing object storage data AI-ready without migration or format changes. Fusion Direct connects directly to S3 buckets, providing file access while maintaining the original objects as the gold copy. This creates a unified object and file view, beneficial for HPC and AI training workloads, allowing CTERA’s content services to enrich data directly within these buckets while leveraging caching capabilities. The platform also announced a new connector for N8N, a low-code/no-code automation platform. This integration allows users to build visual workflows and pipelines that interact with CTERA data, accessing its content services, including enriched metadata, semantic search, and experts, directly from N8N’s rich ecosystem of connectors. These developments underscore CTERA’s strategy to provide versatile access and automation methods for AI-ready data.
Personnel: Aron Brand
Thank you for being part of the Tech Field Day community! Our mailing list is a great way to stay up to date on our events and technical content, and we appreciate your signup.
We promise that we’ll never spam you, send ads, or sell your information. This list will only be used to communicate with our community about our events and content. And we’ll limit it to no more than one message per week.
Although we only need your email address, it would be nice if you provided a little more information to help us get to know you better!