§ 00 · Research
AI systems,
AI systems,
stress-tested.
Research-driven work exploring safety, reliability, benchmarking, and behavior under stress in large-scale software and AI systems.
§ 01
10 entries
Selected entries
LLM-CODEGEN
Advanced code generation system using Large Language Models.
Large-Scale Bug Prediction
Benchmarking bug prediction across 700k Python projects.
AI Recommender
Intelligent recommendation engine for personalized content.
AI in Software Engineering
Research on applying AI techniques to software engineering problems.
Research Paper
Academic research paper repository.
VLM Failure Modes
Analysis of failure modes in Vision-Language Models.
VLM Adversarial Defense
Defense mechanisms against adversarial attacks on VLMs.
Amnesic VLM Defense
Amnesic defense techniques for Vision-Language Models.
AI Impact on Jobs
Analysis of AI's impact on the job market.
Sentiment Steering GPT
Steering GPT output sentiment.