AI Control
Redwood Research › Research Areas › ai-control
Record Metadata
| Record Key | ai-control |
| Entity | Redwood Research |
| Collection | Research Areas(2 records total) |
| Schema | Major research initiatives and focus areas. |
| YAML File | packages/kb/data/things/dwMzc9WzPa.yaml |
Fields
| Name | AI Control |
| Description | Developing techniques to safely deploy AI systems even if they are not fully aligned |
| Started | Jan 2023 |
| Key Publication | arxiv.org↗ |
| Notes | Key paper on AI control; argues control is a complementary approach to alignment |
Other Records in Research Areas (1)
| Key | Name | Description | Started |
|---|
| adversarial-robustness | Adversarial Robustness | Testing and improving robustness of AI safety techniques against adversarial inputs | Jun 2021 |