Longterm Wiki

Overview

Navigation

Overview

Updated 2026-02-09History Data

Page StatusResponse

Edited 3 months ago7 words1 backlinks

Content1/13

Issues3

QualityRated 57 but structure suggests 13 (overrated by 44 points)

StaleLast edited 78 days ago - may need review

StructureNo tables or diagrams - consider adding visual content

Natural Abstractions

Concept

Natural Abstractions

The hypothesis that natural abstractions converge across learning processes, aiding alignment

Research Areas

7 words · 1 backlinks

This page is a stub. Content needed.

Related Wiki Pages

Top Related Pages

Organization

FAR AI

AI safety research nonprofit founded in 2022 by Adam Gleave and Karl Berzins, focusing on adversarial robustness, model evaluation, and alignment r...

Research Area

Interpretability

Understanding AI systems by reverse-engineering their internal computations to detect deception, verify alignment.

Approach

Scheming & Deception Detection

Research and evaluation methods for identifying when AI models engage in strategic deception—pretending to be aligned while secretly pursuing other...

Approach

Sparse Autoencoders (SAEs)

Sparse autoencoders extract interpretable features from neural network activations using sparsity constraints.

Organization

Redwood Research

A nonprofit AI safety and security research organization founded in 2021, known for pioneering AI Control research, developing causal scrubbing int...

Natural Abstractions

Natural Abstractions

Related Wiki Pages

Top Related Pages

FAR AI

Interpretability

Scheming & Deception Detection

Sparse Autoencoders (SAEs)

Redwood Research

Approaches

Risks

Analysis

Safety Research

Organizations

Key Debates

Other

Concepts