Google releases Magika 1.0, an open-source AI-powered document inspection tool: fully migrated to Rust.

Google releases Magika 1.0, an open-source AI-powered document inspection tool: fully migrated to Rust.


On November 7th, local time, Thursday, Google announced the release of Magika 1.0, the first stable version of its AI-based file type detection system. Refactored in Rust for improved speed and memory safety, Magika has been widely adopted in the open-source community since its open-source release early last year, with over 1 million downloads per month. This update brings a completely new architecture, performance improvements, and support for more file types.

As mentioned earlier, the biggest change in Magika 1.0 is that its core engine has been completely rewritten in Rust for enhanced performance and memory safety. Additionally, the new Magika provides native Rust command-line tools, capable of identifying hundreds of files per second on a single core and scaling to thousands per second on multi-core CPUs.

The system uses the ONNX Runtime for model inference and leverages the Tokio framework for asynchronous parallel processing. Google's test data shows that on a MacBook Pro (M4), Magika can process approximately 1,000 files per second. Regarding file type support, Magika 1.0 expands its detection capabilities to over 200 file formats, double the number of the initial version. New categories include:

Data Science and Machine Learning: Supports Jupyter Notebooks (ipynb), Numpy (npy, npz), PyTorch (pytorch), ONNX (onnx), Apache Parquet (parquet), and HDF5 (h5) files;

Modern Programming and Web Development: Adds support for Swift, Kotlin, TypeScript, Dart, Solidity, WebAssembly (wasm), and Zig;

DevOps and Configuration Files: Supports Dockerfile, TOML, HashiCorp HCL, Bazel build files, and YARA rules;

Databases and Graphics Formats: Adds support for SQLite, AutoCAD (dwg, dxf), Photoshop (psd), and modern web fonts (woff, woff2).

Magika 1.0 also improves its ability to distinguish similar formats, such as JSONL vs. JSON, TSV vs. CSV, Apple binary plist vs. XML plist, and distinguishing between C vs. C++, JavaScript vs. TypeScript files.

Technically, the team faced two major challenges: the massive scale of the training data and the scarcity of samples for some file types. The uncompressed dataset exceeded 3TB, so Google used its self-developed SedPack dataset library, employing streaming loading and decompression techniques for efficient training. Simultaneously, for file types with insufficient samples, the research team used the generative AI tool Gemini to create high-quality synthetic training data, converting existing code and structured files into other formats to enhance the model's generalization ability.

The new version of Magika also updated the Python and TypeScript modules, simplifying the integration process for developers across different languages. Users can install the native client on Linux, macOS, or Windows via simple commands, or install the Python package using `pipx install magika` to use the Rust command-line tool. Google stated that Magika's future development will continue to focus on performance optimization and file type expansion. The team encourages the developer community to contribute, including through testing, feature requests, and code submissions.

Tesla officially announced that its third-generation Optimus humanoid robot production line will be completed and put into operation in 2026.

On November 7th, Tesla officially announced that the pilot production line for its Optimus humanoid robot has begun operating at its Fremont factory, with a la

Tesla officially announced that its third-generation Optimus humanoid robot production line will be completed and put into operation in 2026.

Google releases Magika 1.0, an open-source AI-powered document inspection tool: fully migrated to Rust.

On November 7th, local time, Thursday, Google announced the release of Magika 1.0, the first stable version of its AI-based file type detection system. Refacto

Google releases Magika 1.0, an open-source AI-powered document inspection tool: fully migrated to Rust.

Musk personally created the AI ​​chatbot Ani: Innovation and controversy coexist.

According to a recent report in *The Wall Street Journal*, Tesla and SpaceX CEO Elon Musk has once again demonstrated his focus on specific projects, this tim

Musk personally created the AI ​​chatbot Ani: Innovation and controversy coexist.

Vision Pro immersive content is poised for explosive growth; Apple unveils its 8K video production workflow for the first time.

At its recent "Create Immersive" event, Apple revealed for the first time its detailed production process for the "Apple Immersive Video" format designed for

Vision Pro immersive content is poised for explosive growth; Apple unveils its 8K video production workflow for the first time.

Siri's most powerful external support revealed: Apple plans to pay $1 billion annually to integrate Google Gemini.

Bloomberg's Mark Gurman published a blog post yesterday (November 5th) revealing that Apple is considering paying Google approximately $1 billion annually

Siri's most powerful external support revealed: Apple plans to pay $1 billion annually to integrate Google Gemini.

Google reveals why its AI-powered image model is called Nano Banana: initially just a placeholder, it became a mainstream product after its explosive popularity.

On November 7th, it was reported that Google officially revealed the origin of the name "Nano Banana" for its AI image model: it was originally just a placeho

Google reveals why its AI-powered image model is called Nano Banana: initially just a placeholder, it became a mainstream product after its explosive popularity.

Musk announced that the Tesla Roadster 2 sports car will be unveiled on April 1st next year.

On November 7th, reports indicated that Elon Musk stated at Tesla's annual shareholder meeting that the Tesla Roadster 2 sports car will be unveiled on Apr

Musk announced that the Tesla Roadster 2 sports car will be unveiled on April 1st next year.

Amazon launches Kindle Translate AI translation service: one-click translation of author's books.

On November 7th, Amazon announced the launch of Kindle Translate, an AI-powered translation service for its Kindle e-reader platform. Designed specifically for

Amazon launches Kindle Translate AI translation service: one-click translation of author's books.

Comments from an OpenAI executive triggered market volatility, with the Nasdaq falling nearly 2%.

U.S. tech stocks collectively declined overnight and this morning, with the Nasdaq Composite Index falling 1.9%, the S&P 500 Index falling 1.12%, and the Dow

Comments from an OpenAI executive triggered market volatility, with the Nasdaq falling nearly 2%.

Apple urges iPhone 13 and 14 users to upgrade.

Recently, Apple's official WeChat account used the tagline "1314, why not get the 17 now?" to directly urge iPhone 13 Pro and iPhone 14 Pro users to upgra

Apple urges iPhone 13 and 14 users to upgrade.

The Dark Side of the Moon launches Kimi K2 Thinking, the most powerful open-source thinking model, with multiple capabilities reaching state-of-the-art (SOTA) levels.

Last night, Dark Side of the Moon officially released its new generation open-source thinking model, Kimi K2 Thinking. Trained based on the "model as Agent" c

The Dark Side of the Moon launches Kimi K2 Thinking, the most powerful open-source thinking model, with multiple capabilities reaching state-of-the-art (SOTA) levels.

Google launches Project Suncatcher research program to take TPU AI chips into space.

Google officially launched Project Suncatcher on April 4th (US time), another disruptive and risky innovation project aiming for "moonshots" (a metaphor for a

Google launches Project Suncatcher research program to take TPU AI chips into space.

Reports suggest that Apple's M5 Ultra chip will debut in 2026, with the Mac Studio set to be the first to feature it.

Bloomberg's Mark Gurman published a blog post today (November 5th) revealing that Apple plans to launch the M5 Ultra chip in 2026, initially for the new M

Reports suggest that Apple's M5 Ultra chip will debut in 2026, with the Mac Studio set to be the first to feature it.

Fixed: Microsoft acknowledges a bug in Windows 10 that prompted users to upgrade to Windows 11 via an error pop-up.

Reports indicate that Microsoft has acknowledged a bug in Windows 10 that caused many computers still under support to incorrectly display the message "Your v

Fixed: Microsoft acknowledges a bug in Windows 10 that prompted users to upgrade to Windows 11 via an error pop-up.

Sony's backend code suggests that cross-platform purchasing between PS5 and PC may be possible in the future.

Data miner Amethxst has discovered new hidden system icons in the PlayStation backend, suggesting Sony may be bringing "cross-platform purchase" support to th

Sony's backend code suggests that cross-platform purchasing between PS5 and PC may be possible in the future.