If we think new model has important public safety or security implications, we add it to the tracker. New entries usually introduce a capability that hadn’t previously existed, or represent the proliferation of a flagged capability of concern.
Each entry includes our best assessment of the model’s scale (in terms of number of parameters, dataset size, and total FLOPs of compute), a short description of the model and its capabilities, some industry context, and other information that we think helps paint a picture of the model’s significance to public safety.
Our methodology is constantly evolving. If you believe we’re omitting useful information or have any suggestions for us, please submit a correction above, or email us at firstname.lastname@example.org.