LLMs for CyberSecurity¶
LLMs for CyberSecurity References¶
- Generative AI and Large Language Models for Cyber Security: All Insights You Need, May 2024
- A Comprehensive Review of Large Language Models in Cyber Security, September 2024
- Large Language Models in Cybersecurity: State-of-the-Art, January 2024
- How Large Language Models Are Reshaping the Cybersecurity Landscape | Global AI Symposium talk, September 2024
- Large Language Models for Cyber Security: A Systematic Literature Review, July 2024
- Using AI for Offensive Security, June 2024
Agents for CyberSecurity References¶
- Blueprint for AI Agents in Cybersecurity - Leveraging AI Agents to Evolve Cybersecurity Practices
- Building AI Agents: Lessons Learned over the past Year
Comparing LLMs¶
There are several sites that allow comparisons of LLMs e.g.
- https://artificialanalysis.ai/
- Independent analysis of AI models and API providers. Understand the AI landscape to choose the best model and provider for your use-case
- https://llmpricecheck.com/
- Compare and calculate the latest prices for LLM (Large Language Models) APIs from leading providers such as OpenAI GPT-4, Anthropic Claude, Google Gemini, Mate Llama 3, and more. Use our streamlined LLM Price Check tool to start optimizing your AI budget efficiently today!
- https://openrouter.ai/rankings?view=day
- Compare models used via OpenRouter
- https://github.com/vectara/hallucination-leaderboard
- LLM Hallucination Rate leaderboard
- https://lmarena.ai/?leaderboard
- Chatbot Arena is an open platform for crowdsourced AI benchmarking
- https://aider.chat/docs/leaderboards/
- Benchmark to evaluate an LLM’s ability to follow instructions and edit code successfully without human intervention
- https://huggingface.co/spaces/TIGER-Lab/MMLU-Pro
- Benchmark to evaluate language understanding models across broader and more challenging tasks
See also Economics of LLMs: Evaluations vs Pricing - Looking at which model to use for which task