🔍 Search

Open
⚡️OpenAI launches Aardvark, a GPT-5 agent for code security

⚡️OpenAI launches Aardvark, a GPT-5 agent for code security

Microsoft's Copilot access web, OpenAI offers ChatGPT credits, Gemini discounts, Moonshot's new model, Perplexity launc

Signup  |  Work With Us  |  Follow on X  | Read on Web

alpha_signal_image_1

Hey James

Welcome to AlphaSignal, the most read source of news by AI engineers and researchers.


Every day, we identify and summarize the top 1% of news, papers, models, and repos, so you're always up to date.


Here's today's roundup:

Summary

Read time: 4 min 38 sec

Top News

OpenAI launches Aardvark, a GPT-5 agent that hunts and fixes code bugs

you.com

Discover how to prioritize and optimize AI investments using You.com's guide

Signals

 Microsoft introduces Copilot update to access and summarize web content

 OpenAI announces ChatGPT credits for flexible Codex and Sora use

 Google increases Gemini 2.5 caching discount from 75% to 90%

 Moonshot AI releases new open-source faster attention architecture

 Perplexity unveils AI-driven tool to make global patent research accessible

Trending Papers

New "Memory Folding" tech gives DeepAgent brain-like reasoning powers

Alibaba introduces AgentFold, a dynamic memory system for web agents

Google's VISTA teaches video models to improve themselves over time

How To

Train SmolLM3: The complete Hugging Face training guide

Top News

OpenAI introduces Aardvark, a GPT-5-powered agent that scans code and proposes verified security fixes

OpenRouter releases Sonoma Dusk Alpha with 2M token context

alpha_signal_image_2

Every big software project hides a few nasty surprises. Vulnerabilities sneak in through small commits, unnoticed until they break something important or worse, expose data.


Security tools can help, but they often drown engineers in false alarms or miss logic errors that only a human reviewer would spot.


That's the problem Aardvark takes on. Built by OpenAI and powered by GPT-5, Aardvark acts like an autonomous security researcher that scans your repositories, understands code behavior, and proposes fixes.


It doesn't use traditional methods like fuzzing or static analysis. Instead, it reads, reasons, and tests code the way a real engineer would.


How it works
Once connected to your GitHub repository, Aardvark runs a full scan to understand your system and build a threat model like a map of what's important to protect. It then keeps watch as you push new commits.

  • Flags suspicious code changes in real time.

  • Tests each finding in a sandbox to confirm it's exploitable.

  • Generates a Codex-based patch for human review.

  • Submits pull requests directly into your workflow.

Performance and results
OpenAI reports that Aardvark identified 92% of known and synthetic vulnerabilities in benchmark repositories. It has already uncovered and disclosed ten real CVEs in open-source projects.


Who can use it
Right now, Aardvark runs in private beta for select enterprise and open-source partners. You can integrate it with GitHub or CI/CD pipelines, giving it repository access to scan code, propose fixes, and annotate findings.


Aardvark isn't just a vulnerability scanner. It's a new kind of teammate: an AI that reads your codebase like a researcher, tests its own findings, and learns how to make your next commit safer.

TRY NOW
alpha_signal_image_3

Find out how to make every AI investment count

AI is all the rage, but are you using it to your advantage? Find out how to make every AI investment count. We love this practical guide from You.com that walks through a proven framework to identify and prioritize high-value AI opportunities.


You'll learn how to:

  • Map internal workflows and customer journeys to pinpoint where AI can drive measurable ROI

  • Ask the right questions when it comes to AI use cases

  • Align cross-functional teams and stakeholders for a unified, scalable approach

Learn more with this AI Use Case Discovery Guide.

DOWNLOAD GUIDE NOW

partner with us

Signals

Microsoft launches Researcher with Computer Use, letting Copilot browse, navigate, and act securely online

1,167 Likes

OpenAI adds a new credit system to extend Codex and Sora access for Plus and Pro users after limits are reached

3,378 Likes

Google raises Gemini 2.5 caching discount to 90%, reducing input token costs for repeated prompts

1,649 Likes

Moonshot AI unveils Kimi Linear, an open-source linear attention model achieving 6× decoding speed on long-context tasks 

993 Likes

Perplexity debuts Patents, an AI research agent with direct access to global patent databases

901 Likes

Papers

New "Memory Folding" tech gives DeepAgent brain-like reasoning powers

1,849 Likes

The method bridges reinforcement learning i.e., on-policy and supervised distillation which is off-policy. It uses reverse-KL per-token loss, training students on what they actually produce, reducing compounding errors and enabling smaller models to perform like larger ones.

Alibaba introduces AgentFold, a dynamic memory system for web agents

1,382 Likes

AgentFold replaces passive history logs with active context management. It selectively compresses past steps to retain key details, surpassing DeepSeek-V3.1-671B and OpenAI o4-mini using only supervised fine-tuning.

Google's VISTA teaches video models to improve themselves over time

1,584 Likes

Google and NUS introduce VISTA, a self-improving video generation agent that iteratively refines prompts through multi-agent feedback. It improvs video quality and alignment by 60% over state-of-the-art baselines.


How To

Train SmolLM3: The Complete Hugging Face Training Guide

alpha_signal_image_4

This guide walks through the full process of training SmolLM3, using Hugging Face's new Smol Training Playbook. It explains how to design, run, and analyze pretraining and post-training experiments for language models.


The playbook closes the trilogy that started with FineWeb for dataset creation and Ultrascale for infrastructure scaling.


You'll learn how to:

  • Prepare and curate datasets for model pretraining

  • Configure and launch distributed training across GPU clusters

  • Tune hyperparameters systematically for optimal results

  • Apply post-training techniques to boost model quality

  • Log, compare, and analyze experiments for reproducibility

  • Manage scaling challenges and efficiency trade-offs in training

LEARN MORE

Post a Comment

0 Comments

Users_Online! 🟢

FOUNDER/AUTHOR

FOUNDER/AUTHOR VHAVENDA I.T SOLUTIONS