Why we invested in Anyformat
Backing AnyFormat to transform Data Processing with GenAI
At Abac Nest Ventures, we support technologies that drive meaningful change in business operations. AnyFormat, a Madrid-based GenAI platform, redefines how companies approach unstructured data. From PDFs and images to audio files, unstructured data is a potential goldmine for insights—but extracting these efficiently has long been a challenge. We’re excited to join AnyFormat’s €520k pre-seed round to support their journey in solving this critical problem for businesses.

The Challenge: 80% of Business Data Is Untapped
In today’s data-driven economy, approximately 80% of a company’s information assets are unstructured, often buried inside documents like spreadsheets, PDFs, and files used alongside ERP systems and other technologies. Despite advancements, businesses still rely on these side documents, making capturing and leveraging all their data challenging. This unstructured data holds valuable insights for decision-making across finance, healthcare, and insurance industries. Yet traditional extraction methods are often too costly, inefficient, and complex, leaving much of this data untapped. AnyFormat bridges this gap by automating the extraction and structuring of unstructured data, making it accessible, actionable, and cost-effective.
AnyFormat’s Solution: A GenAI Platform for Unstructured Data
With advanced GenAI capabilities, AnyFormat transforms unstructured data into structured, actionable insights, enabling businesses to make faster, smarter decisions. Built on a foundation of advanced AI models, the platform can handle a variety of data formats and integrates seamlessly with enterprise systems, prioritizing data privacy and cost efficiency.
The Platform: AnyFormat’s tool enables users to configure data extractions easily. Users can create, review, or delete extraction configurations, download extracted data, or publish an API to process files programmatically. The platform effortlessly scales from single files to millions, with AnyFormat managing the complexity of a full GenAI-powered extraction pipeline in the background.
The Processing Engine: AnyFormat’s core technology leverages foundational models such as OpenAI, Anthropic, and others, depending on the required skill sets. This "engine and bodywork" approach allows AnyFormat to pull from best-in-class models for optimal performance. On top of these foundational models, AnyFormat layers its proprietary models to fine-tune and train the data extraction pipeline with precision and efficiency. The engine is organized into five pillars:
Integrations: Flexible data input via APIs, cloud storage, and more.
Data Transformations: Standardizes complex data using a combination of classical OCR algorithms, vision models, content classifiers, and Retrieval-Augmented Generation (RAG).
Data Extractions: Fine-tuned Small Language Models (SLMs) ensure high-accuracy extraction which can be tailored to each client’s needs.
Data Validations: AI-driven validation checks data quality, including schema validation and reliability scores.
Data Exports: Delivers structured data in formats ready for integration, such as CSV, JSON, and SQL.
Each pillar is designed to handle diverse data types and sources, transform them as required, accurately extract relevant information, validate data integrity, and export it in the desired format. This flexible architecture makes AnyFormat a powerful comprehensive, scalable data processing tool. AnyFormat’s lightweight, secure SLMs also support local data processing, which reduces costs and enhances privacy.
The Founders: Firsthand AI Experience with a Vision for Change
AnyFormat’s founding team brings exceptional expertise to the GenAI landscape. CEO Juan Huguet-García holds a Ph.D. in Nuclear Physics and has years of hands-on experience developing and implementing GenAI solutions. Alongside co-founders Alejandro Fernández and Diego Pérez-Sastre, who each bring deep technical skills from their work in Clarity AI’s GenAI division, the team has a firsthand understanding of the challenges around unstructured data processing. Together, they designed AnyFormat to address the data quality, privacy, and usability gaps they encountered in previous roles. This highly qualified team is well-equipped to push the boundaries of what’s possible in data automation. You can hear more about Juan’s perspective on unstructured data in his recent Itnig Podcast interview.
A Market Primed for Transformation
The Intelligent Document Processing (IDP) market is projected to reach $20 billion by 2032, growing at a 30% CAGR. This shift underscores the importance of data automation for scaling operations and improving efficiency. AnyFormat has already launched pilot programs with clients in high-demand sectors like insurance and travel, proving its ability to streamline workflows and enhance decision-making. With a growing market demand and a scalable, efficient solution, AnyFormat is well-positioned for significant impact.
Our Decision: Investing in the Future of Data Automation
The vast amount of unstructured data generated across industries has created a pressing need for tools to convert raw information into valuable insights. GenAI now offers the power to transform this data into actionable knowledge, paving the way for innovation and competitive advantages across sectors. We believe AnyFormat is the team positioned to lead this transformation. With a flexible, efficient platform and deep expertise in AI, AnyFormat enables companies to unlock the value of unstructured data. This vision of data automation as a driver of industry-wide impact is why we proudly support AnyFormat in their €520k pre-seed round.
For more insights into AnyFormat’s mission and technology, see recent features in EU-Startups and Yahoo Finance or visit their website at anyformat.ai.
Follow Juan Huguet, Anyformat’s CEO on LinkedIn, where he regularly discusses recent developments in GenAI.



