Technology News | Slashdot

Mistral Adds a New API That Turns Any PDF Document Into an AI-Ready Markdown File 24

Posted by BeauHD on Friday March 07, 2025 @06:00AM from the next-big-leap dept.

Mistral has launched a new multimodal OCR API that converts complex PDF documents into AI-friendly Markdown files. The API is designed for efficiency, handles visual elements like illustrations, supports complex formatting such as mathematical expressions, and reportedly outperforms similar offerings from major competitors. TechCrunch reports: Unlike most OCR APIs, Mistral OCR is a multimodal API, meaning that it can detect when there are illustrations and photos intertwined with blocks of text. The OCR API creates bounding boxes around these graphical elements and includes them in the output. Mistral OCR also doesn't just output a big wall of text; the output is formatted in Markdown, a formatting syntax that developers use to add links, headers, and other formatting elements to a plain text file.

Mistral OCR is available on Mistral's own API platform or through its cloud partners (AWS, Azure, Google Cloud Vertex, etc.). And for companies working with classified or sensitive data, Mistral offers on-premise deployment. According to the Paris-based AI company, Mistral OCR performs better than APIs from Google, Microsoft, and OpenAI. The company has tested its OCR model with complex documents that include mathematical expressions (LaTeX formatting), advanced layouts, or tables. It is also supposed to perform better with non-English documents. [...]

Mistral is also using Mistral OCR for its own AI assistant Le Chat. When a user uploads a PDF file, the company uses Mistral OCR in the background to understand what's in the document before processing the text. Companies and developers will most likely use Mistral OCR with a RAG (aka Retrieval-Augmented Generation) system to use multimodal documents as input in an LLM. And there are many potential use cases. For instance, we could envisage law firms using it to help them swiftly plough through huge volumes of documents. "Over the years, organizations have accumulated numerous documents, often in PDF or slide formats, which are inaccessible to LLMs, particularly RAG systems. With Mistral OCR, our customers can now convert rich and complex documents into readable content in all languages," said Mistral co-founder and chief science officer Guillaume Lample.

"This is a crucial step toward the widespread adoption of AI assistants in companies that need to simplify access to their vast internal documentation," he added.

Gboard Testing Circle, Pill-Shaped Keys On Android (9to5google.com) 36

Posted by BeauHD on Thursday March 06, 2025 @08:50PM from the would-you-look-at-that dept.

US House Panel Subpoenas Alphabet Over Content Moderation (yahoo.com) 40

Posted by BeauHD on Thursday March 06, 2025 @07:30PM from the behind-the-scenes dept.

Eric Schmidt Argues Against a 'Manhattan Project for AGI' (techcrunch.com) 63

Posted by msmash on Thursday March 06, 2025 @09:08AM from the curb-your-enthusiasm dept.

Google is Adding More AI Overviews and a New 'AI Mode' To Search (theverge.com) 33

Posted by msmash on Wednesday March 05, 2025 @07:30PM from the aggressive-expansion dept.

Google Urges DOJ To Reverse Course on Breaking Up Company (yahoo.com) 86

Posted by msmash on Wednesday March 05, 2025 @11:00AM from the how-about-that dept.

Turing Award Winners Sound Alarm on Hasty AI Deployment (ft.com) 10

Posted by msmash on Wednesday March 05, 2025 @10:00AM from the more-warnings dept.

Google Releases SpeciesNet, an AI Model Designed To Identify Wildlife (techcrunch.com) 15

Posted by BeauHD on Tuesday March 04, 2025 @09:00AM from the AI-for-everything dept.

Google Play Is Going To Start Highlighting Apps With Widgets (theverge.com) 15

Posted by BeauHD on Monday March 03, 2025 @08:50PM from the widga-look-at-that dept.

Google Calls for Measurable Memory-Safety Standards for Software (googleblog.com) 44

Posted by EditorDavid on Saturday March 01, 2025 @11:34AM from the a-few-pointers dept.

Memory safety bugs are "eroding trust in technology and costing billions," argues a new post on Google's security blog — adding that "traditional approaches, like code auditing, fuzzing, and exploit mitigations — while helpful — haven't been enough to stem the tide."

So the blog post calls for a "common framework" for "defining specific, measurable criteria for achieving different levels of memory safety assurance." The hope is this gives policy makers "the technical foundation to craft effective policy initiatives and incentives promoting memory safety" leading to "a market in which vendors are incentivized to invest in memory safety." ("Customers will be empowered to recognize, demand, and reward safety.")

In January the same Google security researchers helped co-write an article noting there are now strong memory-safety "research technologies" that are sufficiently mature: memory-safe languages (including "safer language subsets like Safe Buffers for C++"), mathematically rigorous formal verification, software compartmentalization, and hardware and software protections. (With hardware protections including things like ARM's Memory Tagging Extension and the (Capability Hardware Enhanced RISC Instructions, or "CHERI", architecture.) Google's security researchers are now calling for "a blueprint for a memory-safe future" — though Importantly, the idea is "defining the desired outcomes rather than locking ourselves into specific technologies."

Their blog post this week again urges a practical/actionable framework that's commonly understood, but one that supports different approaches (and allowing tailoring to specific needs) while enabling objective assessment: At Google, we're not just advocating for standardization and a memory-safe future, we're actively working to build it. We are collaborating with industry and academic partners to develop potential standards, and our joint authorship of the recent CACM call-to-action marks an important first step in this process... This commitment is also reflected in our internal efforts. We are prioritizing memory-safe languages, and have already seen significant reductions in vulnerabilities by adopting languages like Rust in combination with existing, wide-spread usage of Java, Kotlin, and Go where performance constraints permit. We recognize that a complete transition to those languages will take time. That's why we're also investing in techniques to improve the safety of our existing C++ codebase by design, such as deploying hardened libc++.

This effort isn't about picking winners or dictating solutions. It's about creating a level playing field, empowering informed decision-making, and driving a virtuous cycle of security improvement... The journey towards memory safety requires a collective commitment to standardization. We need to build a future where memory safety is not an afterthought but a foundational principle, a future where the next generation inherits a digital world that is secure by design.
The security researchers' post calls for "a collective commitment" to eliminate memory-safety bugs, "anchored on secure-by-design practices..." One of the blog post's subheadings? "Let's build a memory-safe future together."

And they're urging changes "not just for ourselves but for the generations that follow."

Google's Taara Hopes To Usher in a New Era of Internet Powered by Light (wired.com) 20

Posted by msmash on Friday February 28, 2025 @10:10PM from the moonshots dept.

Microsoft Begins Turning Off uBlock Origin, Other Extensions In Edge (neowin.net) 73

Posted by BeauHD on Friday February 28, 2025 @08:30PM from the it's-happening dept.

Google's Sergey Brin Urges Workers To the Office at Least Every Weekday 140

Posted by msmash on Friday February 28, 2025 @02:50PM from the clocking-the-hours dept.

DeepMind CEO Says AGI Definition Has Been 'Watered Down' (bloomberg.com) 42

Posted by msmash on Friday February 28, 2025 @01:33PM from the closer-look dept.

Google Tweak Creates Crisis for Product-Review Sites (wsj.com) 27

Posted by msmash on Friday February 28, 2025 @11:20AM from the closer-look dept.

Apple Launches 'Age Assurance' Tech As US States Mull Social Media Laws (reuters.com) 53

Posted by BeauHD on Thursday February 27, 2025 @09:25PM from the digital-bouncers dept.

Thousands of Exposed GitHub Repositories, Now Private, Can Still Be Accessed Through Copilot (techcrunch.com) 19

Posted by BeauHD on Thursday February 27, 2025 @06:40PM from the PSA dept.

The New York City Subway Is Using Google Pixels To Listen for Track Defects (wired.com) 23

Posted by msmash on Thursday February 27, 2025 @01:21PM from the moving-forward dept.

Pixel Watch 3 Gets FDA Clearance For Loss of Pulse Alerts 30

Posted by BeauHD on Wednesday February 26, 2025 @09:00PM from the dying-alerts dept.

Google Is Making It Easier To Remove Personal Info On Search (engadget.com) 6

Posted by BeauHD on Wednesday February 26, 2025 @06:20PM from the new-and-improved dept.

2016	The FBI Recommends Not To Indict Hillary Clinton For Email Misconduct	1010 comments
2015	Greece Rejects EU Terms	1307 comments
2007	MS Moves R&D To Canada Due To Immigration Problem	765 comments
2006	Your Favorite Support Anecdote	1177 comments
2002	Isn't it Time for Metric Time?	1717 comments

Slashdot Top Deals