Blog Posts | Joshua Berkowitz

3 Articles

2025 × GPU ×

FAISS, Up Close: Fast Similarity Search For The Vector Age

Every modern AI product has one quiet workhorse: finding the nearest neighbors of a vector fast. FAISS is the library many of us reach for when the dataset gets large and latency matters. Built at Met...

ANN cuVS faiss GPU Meta AI vector search

Aug 29, 2025

0 74712

Github Repos

BitNet: 1-bit LLMs Land With Practical Inference on CPUs and GPUs

BitNet from Microsoft Research is the official C++ inference stack for native 1-bit large language models, centered on BitNet b1.58. The repo ships fast, lossless ternary kernels for CPUs, a CUDA W2A8...

1-bit LLM BitNet CPU GGUF GPU inference llama.cpp quantization T-MAC

Aug 27, 2025

0 53372

Github Repos

Mistral Compute: Democratizing Advanced AI Infrastructure for Everyone

What if anyone, from startups to nations, could access the powerful infrastructure needed to build next-generation AI? Mistral AI is making this a reality with Mistral Compute , a platform designed to...

AI infrastructure cloud computing data sovereignty enterprise AI Europe GPU open science sustainability

Jun 17, 2025

0 14938

News

Our latest content

Check out what's new !

See all

Ads

Prompt Maker Image Generator

Struggling with the perfect AI image prompt? My free app helps you generate brilliant ideas and instantly creates an image to match. Go from concept to creation in two clicks!

Most Popular Articles

Check out what the hot topics are!

See all

Every shirt tells a story—and every story

#ClothingForACause