Proud to announce:
💫 garak - an LLM vulnerability scanner💫
🔎 Check if a model is susceptible to common attacks
🦜 Supports HuggingFace, OpenAI, ggml, Cohere, ...
🔧 >70 probes: prompt injection, false claims, toxicity, encoding evasion, ..
ChatGPT not best at many language tasks. It's outranked by other systems on many NLP benchmarks in current evaluation. For 77.5% of tasks examined, other systems are better than ChatGPT.
opensamizdat.com/posts/chatgpt_…
Is Cosine-Similarity of Embeddings Really About Similarity?
Netflix cautions against blindly using cosine similarity as a measure of semantic similarity between learned embeddings, as it can yield arbitrary and meaningless results.
📝arxiv.org/abs/2403.05440
ELIZA designer Joseph Weizenbaum observed: “What I had not realized is that extremely short exposures to a relatively simple computer program could induce powerful delusional thinking in quite normal people.”