Dreadnode
@dreadnode.bsky.social
📤 103
📥 16
📝 46
Building AI systems that advance the state of offensive security |
https://www.dreadnode.io/
Dreadnode is a proud sponsor of
@sentinelone.com
's
#labscon25
! Heading to Scottsdale this week? Catch
@machinavelli.com
and Brad Palm's talk, Auto-Poking the Bear—Analytical Tradecraft in the AI Age, on Thursday at 2pm MT. Or, shoot us a DM to find time to meet up onsite!
8 days ago
0
2
1
!!!
add a skeleton here at some point
8 days ago
0
1
0
Incoming: Dreadnode paper drop from Shane Caldwell and the crew. PentestJudge—Judging Agent Behavior Against Operational Requirements:
arxiv.org/abs/2508.02921
Explore how we built an LLM-as-judge system for evaluating the operations of pentesting agents (inspired by PaperBench).
about 2 months ago
0
1
1
reposted by
Dreadnode
about 2 months ago
✍ After talking AI Action Plan on
@cyberscoop.bsky.social
, wrote up
@dreadnode.bsky.social
thoughts on implementation ➡️
dreadnode.io/blog/five-ta...
‼️ While we debate frameworks, adversaries build AI attack capabilities. We need: evaluation ecosystems, red teaming, and procurement standards.
add a skeleton here at some point
0
0
1
In our latest blog, Shane Caldwell breaks down the process of creating a fully integrated, self-verifying agentic system that can do modern Windows Active Directory red team operations, without human interaction. Read it here:
dreadnode.io/blog/evals-t...
loading . . .
Evals: The Foundation for Autonomous Offensive Security
Learn how to build robust evaluations for autonomous red team agents that can perform Windows Active Directory operations. This blog covers action space design, programmatic verification, and measurin...
https://dreadnode.io/blog/evals-the-foundation-for-autonomous-offensive-security?utm_source=social&utm_medium=social&utm_campaign=shane_evals
about 2 months ago
0
2
1
Rise and shine! We're going live on Off By One with Stephen Sims this afternoon—meet us here at 11 AM PT:
www.youtube.com/live/BzOmGw-...
loading . . .
Building and Deploying Offensive Security Agents with Dreadnode
YouTube video by Off By One Security
https://www.youtube.com/live/BzOmGw-LaR0
2 months ago
0
0
0
In this edition of our From Compute to Congress policy blog series, Dreadnode Head of Policy Daria Bahrami explores how the TEST AI Act and red teaming standards can establish U.S. leadership in AI security:
dreadnode.io/blog/from-co...
loading . . .
From Compute to Congress: Setting the Global Standard for AI Security
Daria explores how the TEST AI Act and red teaming standards can establish American leadership in AI security—a winning policy roadmap from Critical Effect DC 2025.
https://dreadnode.io/blog/from-compute-to-congress-setting-the-global-standard-for-ai-security?utm_source=social&utm_medium=social&utm_campaign=compute_to_congress
3 months ago
1
2
0
Read
@rad-ads.bsky.social
's breakdown of Claude's attack sequence against the notoriously hard-to-solve "turtle" challenge:
dreadnode.io/blog/ai-red-...
loading . . .
AI Red Teaming Case Study: Claude 3.7 Sonnet Solves the Turtle Challenge
See how Claude solved a notoriously difficult AI/ML CTF challenge, going beyond pattern matching to genuine problem-solving under adversarial conditions.
https://dreadnode.io/blog/ai-red-teaming-case-study-claude-sonnet-solves-turtle?utm_source=social&utm_medium=social&utm_campaign=airtbench
3 months ago
0
0
0
Introducing AIRTBench, an AI red teaming benchmark for evaluating language models’ ability to autonomously discover and exploit AI/ML security vulnerabilities. Read the paper on arXiv:
arxiv.org/abs/2506.14682
Open-source dataset and benchmark eval code repo:
github.com/dreadnode/AI...
3 months ago
1
3
1
Check out
@machinavelli.com
's "Build with AI" Rigging workshop from
@pivotcon.bsky.social
:
github.com/vmsv/pivot20...
loading . . .
GitHub - vmsv/pivot2025-llmworkshop
Contribute to vmsv/pivot2025-llmworkshop development by creating an account on GitHub.
https://github.com/vmsv/pivot2025-llmworkshop
4 months ago
0
5
2
v3 of Rigging is out now. If you’re working with LLMs to build agents or run evaluations, check it out. We just added: - Prompt caching for supported providers - A unified tool system for function calling and fallbacks to xml/json parsing - Native MCP integration
docs.dreadnode.io/open-source/...
loading . . .
https://docs.dreadnode.io/open-source/rigging
4 months ago
0
3
2
Introducing our new blog series: "From Compute to Congress: Decoding AI Policy" by Dreadnode Head of Policy Daria Bahrami | Read the first post here:
dreadnode.io/blog/from-co...
4 months ago
0
1
2
Are manual or automated attacks more effective when attacking LLMs? We found that automated approaches achieve significantly higher success rates (69.5%) compared to manual techniques (47.6%). More insights on LLM attack execution methods here 👉
dreadnode.io/blog/the-aut...
5 months ago
0
1
0
Strikes waitlist. Now open.
platform.dreadnode.io/waitlist/str...
[must have a Dreadnode account]
5 months ago
0
2
2
What's your take on the growing dominance of automated attacks and the implications for AI red teams? Here's ours— based on our analysis of 30 LLM challenges, attempted by 1,674 unique Crucible users, across 214,271 attack attempts:
arxiv.org/abs/2504.19855
5 months ago
0
4
6
@moohax.bsky.social
joins
@gregotto.bsky.social
on CyberScoop's Safe Mode podcast! Tune in at the 10-minute mark for a discussion on how AI fits into the offensive security narrative and what it means for tooling and defenses:
www.youtube.com/watch?v=ZReR...
loading . . .
Dreadnode CEO Will Pearce on the ever-changing field of offensive AI security
YouTube video by CyberScoop
https://www.youtube.com/watch?v=ZReRN5Jz6_0
5 months ago
0
1
0
Headed to RSA? Come meet the Dreadnode crew! Whether you're looking for a private deep dive into our tech or want to hang out and talk offensive AI research, we'd love to connect. Limited availability; Come and get it:
calendly.com/tori-dreadno...
#BayArea
#SanFrancisco
#RSAC2025
#OffensiveAI
5 months ago
0
1
1
Hey, we know that guy! Catch Dreadnode's
@radads.bsky.social
on NASDAQ
#TradeTalks
alongside
@bugcrowd.com
CEO
@davegerryjr.bsky.social
and NFL CISO
@tomasmald.bsky.social
. Tune in for a candid conversation on the intersection of AI and cybersecurity:
www.nasdaq.com/videos/ever-...
loading . . .
https://www.nasdaq.com/videos/ever-changing-landscape-ai-safety
6 months ago
1
6
2
reposted by
Dreadnode
Martin Wendiggensen
6 months ago
Will be talking about
@dreadnode.bsky.social
‘s great open-source rigging repo and how to build your own LLM workflows! Super excited!
add a skeleton here at some point
0
3
1
🌭🔪⚾️🦥🔥🔄🤨🛜 8 new Challenges now live in Crucible:
platform.dreadnode.io/crucible
These Challenges might look familiar… they first appeared at DEFCON 30 and were recently refactored for Crucible—enjoy! [Filter>Subject>DEFCON-30]
6 months ago
0
2
1
New blog: Dreadnode’s Policy Recommendations for the U.S. AI Action Plan. Our response focuses on two critical strategies: 1️⃣ Leveraging AI to protect America 2️⃣ Attacking AI to find its limits Read our complete response on the Dreadnode blog:
dreadnode.io/blog/policy-...
loading . . .
Dreadnode’s Policy Recommendations for the U.S. AI Action Plan
Read Dreadnode’s AI policy recommendations for the U.S. AI Action Plan, which focuses on leveraging AI to protect America and attacking AI to find its limits.
https://dreadnode.io/blog/policy-recommendations-us-ai-action-plan
6 months ago
0
2
2
We're LIVE:
www.linkedin.com/events/lives...
loading . . .
Live Stream with Dreadnode Founders | LinkedIn
📅 Date: Wednesday, March 19, 2025 | ⏰ Time: 10 AM PT / 1 PM ET Dreadnode, the company at the forefront of offensive AI research and development, recently announced its Series A funding announcement ...
https://www.linkedin.com/events/livestreamwithdreadnodefounders7300049008461238272/theater/
6 months ago
0
2
1
Shoutout to these three Crucible users, who were first to solve this week's new Phantom Cheque Challenge. 👏👏👏 1. conor-99 2. Bilal 3. ken Cheque it out:
platform.dreadnode.io/crucible/pha...
6 months ago
0
1
0
Cheque, check, one-two. We have a new Crucible Challenge for you: Phantom Cheque! Can you evade the cheque scanner and determine the areas of JagaLLM that need to be improved? Act fast; first three to solve this model extraction Challenge announced Friday:
platform.dreadnode.io/crucible/pha...
7 months ago
0
2
0
reposted by
Dreadnode
sina
7 months ago
can’t recommend
@dreadnode.bsky.social
enough - learning a lot going through the challenges and docs
add a skeleton here at some point
0
2
1
In this week's new Crucible Challenge, find the hidden phrase in the backdoored model using dyana, an open source tool created by Dreadnode's Ads Dawson. Can you outwit the llamas?
platform.dreadnode.io/crucible/dya...
7 months ago
1
1
0
Big news from our crew today! We announced our $14M Series A funding led by Decibel with participation from Next Frontier Capital, In-Q-Tel (IQT), Sands Capital, and Indie VC and released two new solutions: Strikes and Spyglass. Read the announcement:
dreadnode.io/blog/series-...
loading . . .
Dreadnode Secures $14M to Build AI Systems that Advance the State of Offensive Security
Dreadnode Raises $14M to Advance Offensive Security | Series A Announcement
https://dreadnode.io/blog/series-a-funding-announcement
7 months ago
0
10
5
Raiders of the Lost AI: Attempt our new Crucible Challenge, Palimpsest! Decode the hidden message in the scroll, find the flag. First three to solve will be announced Friday, right here. Get started:
crucible.dreadnode.io/challenges/p...
7 months ago
1
2
0
Kudos to these individuals for killing this week’s Crucible Challenge. First three to solve Popcorn: 1️⃣ conor-99 2️⃣ garr 3️⃣ mejokim Have you attempted Popcorn yet? Enter Crucible:
crucible.dreadnode.io/challenges/p...
7 months ago
0
3
0
@datasociety.bsky.social
and the AI Risk and Vulnerability Alliance just released “Red Teaming in the Public Interest,” a report examining how red teaming methods are being adapted to evaluate genAI. Read the report, featuring commentary from
@moohax.bsky.social
:
datasociety.net/library/red-...
loading . . .
Red-Teaming in the Public Interest
This report offers a vision for red-teaming in the public interest: a process that goes beyond system-centric testing of already built systems to consider the full range of ways the public can be invo...
https://datasociety.net/library/red-teaming-in-the-public-interest/
7 months ago
0
5
3
Boo! 👻 In our new Crucible Challenge, Popcorn, an LLM firewall is blocking access to a protected SQL table. Can you unmask the secret info? First-to-solve announced Friday. Get started:
crucible.dreadnode.io/challenges/p...
7 months ago
0
4
1
Another week, another new Crucible Challenge. Shoutout to these three for being the first to solve our reasoning model Challenge, DeepTweak! Get your tweak on:
crucible.dreadnode.io/challenges/d...
8 months ago
0
1
0
New to Rigging: 🔥 Tracing 🛠️ API Tools 💻 HTTP Generator 🐍 Prompts as Tools →
github.com/dreadnode/ri...
8 months ago
0
7
4
NEW Crucible Challenge: DeepTweak, an exploration of reasoning model behavior. Cause enough confusion 😵💫, retrieve the flag. Think fast; The first three users to solve DeepTweak will be announced Friday! ➡️
https://crucible.dreadnode.io/challenges/deeptweak?utm_source=social&utm_medium=social&u…
8 months ago
0
4
4
Congrats to these hosers for being the first three to solve the canadianeh challenge in Crucible! Tune in Tuesday for the next drop 👀 ICYMI, give canadianeh a try:
crucible.dreadnode.io/challenges/c...
8 months ago
0
2
0
Don't be a hozer eh. It's aboot time you started taking model security seriously. Head to Crucible to attempt our new Challenge, canadianeh. Can you be the first to solve it? Check back here Friday. Happy hacking:
https://buff.ly/4gn4hHP
8 months ago
0
4
1
Where in the world is Dreadnode? Catch our founders
@moohax.bsky.social
and Nick Landers at these upcoming AI security events: 💻 NEBULA:FOG:PRIME Hackathon (Saturday, January 25) 🇫🇷 Paris AI Security Forum 2025 (Sunday, February 9) Shoot us a DM to link up!
8 months ago
0
2
1
NEW open source tool from Dreadnode's Simone Margaritelli and
@radads.bsky.social
: dyana, an eBFP sandbox environment designed to load, run, and profile a wide range of files and provide dynamic testing for AI models. You know the drill - try it out:
github.com/dreadnode/dy...
loading . . .
8 months ago
0
3
1
Who's going to
#shmoocon
this weekend?
add a skeleton here at some point
9 months ago
0
2
0
Fantastic walkthrough of the "What's the flag #6" challenge from
@blaisebits.bsky.social
👇
add a skeleton here at some point
9 months ago
0
1
0
Check out v0.4.0 of robopages! 🤖 New updates from Simone Margaritelli (@evilsocket) include: Support for executing commands on another host via SSH, easier integration into CI workflows, support for shared environment variables, and integrations with 13 new tools. —>
https://buff.ly/3VDDGPd
9 months ago
0
6
2
reposted by
Dreadnode
Greg Wells
10 months ago
For all the Burp fans Add some robot smarts to your web app testing - Define scope of analysis - Run requests/responses through your LLM of choice - View labeled findings and vulns As always, the Dreadnode team would love feedback!
add a skeleton here at some point
0
5
1
Introducing burpference, a new Burp Suite extension from Dreadnode's
@radads.bsky.social
! Burpference was created to capture in-scope HTTP requests and responses from Burp Suite’s proxy history and ship them to a remote LLM API in JSON format. Try it out —
github.com/dreadnode/bu...
.
loading . . .
GitHub - dreadnode/burpference: A research project to add some brrrrrr to Burp
A research project to add some brrrrrr to Burp. Contribute to dreadnode/burpference development by creating an account on GitHub.
https://github.com/dreadnode/burpference
10 months ago
0
5
4
Dreadnode’s U.S. team: *Celebrates Thanksgiving* Our resident Canadian:
10 months ago
1
1
0
👋
add a skeleton here at some point
10 months ago
0
2
0
Solid intro to adversarial machine learning attacks from Boschko ➡️
boschko.ca/adversarial-...
loading . . .
Breaking Down Adversarial Machine Learning Attacks Through Red Team Challenges
Learn how to craft and understand adversarial attacks on AI/ML models through hands-on challenges on Dreadnode’s Crucible CTF platform.
https://boschko.ca/adversarial-ml/
10 months ago
0
3
1
you reached the end!!
feeds!
log in