UPDATED 12:00 EST / JUNE 03 2025

AI

Vertesia launches document preparation service to increase AI reliability and accelerate app development

Vertesia Inc., a unified low-code platform for developing and deploying custom generative artificial intelligence applications, today announced the launch of a new semantic document preparation service it says will increase the reliability of AI applications and speed up development.

Vertesia provides a cloud-based application programming interface service that prepares underlying data for use by generative AI models, ensuring output accuracy. According to the company’s own research, up to 50% of the development time spent on generative AI applications is dedicated to document preparation.

The new semantic document preparation service is designed to ease this process and provide developers a rich context for large language models to work with, which Vertesia claims can “eliminate” generative AI hallucinations.

A hallucination is an error where an LLM generates an incorrect or false answer that it states confidently. The causes can be numerous, including training data issues, inherent limitations or challenges in understanding nuanced language or context such as incomplete or noisy data.

“The two concerns we hear most from enterprise leaders are consistent: 95% accuracy isn’t good enough and data preparation is a costly, time-consuming challenge,” said Chief Revenue Officer Chris McLaughlin. “Our Semantic DocPrep service was built to solve both — giving developers a set of APIs to automate document preparation and significantly improve the accuracy and relevancy of LLM outputs.”

The company said using its preparation service, it can convert even the most complex documents, such as reports and regulatory filings, into richly structured, semantically tagged XML. It will do this without rewriting or altering the source. Since the process preserves the original structure, relationships and context, it ensures that the LLM can understand the document without misinterpreting the information, which greatly increases the accuracy of responses.

This document transformation method is designed for developers building custom generative AI applications and retrieval-augmented generation pipelines, also known as RAG, which are used to enhance the accuracy of generative AI apps with real-time data.

The company said its data transformation engine deconstructs documents at the page level and uses the most appropriate AI model based on the content: dense text, tabular data, images or a mix. It will either use LLMs, optical character recognition or vision models. By using this hybrid model, it avoids rewrites to maintain consistency and preserve the original text and generate high-fidelity XML outputs.

The service is accessible via an API, which can be combined directly into a development pipeline. This allows developers to send documents for preparation and receive XML outputs ready for chunking, indexing and model ingestion. No setup or model training is required.

The new Semantic DocPrep is part of the company’s already existing platform, which provides infrastructure for organizations who are looking to build, deploy and manage custom generative AI applications and agents at scale.

Photo: Annie Spratt/Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.