July 2024
In the following google doc., we categorize and summarize recent papers on the security and safety threats/risks of LLMs. Note that there are already some awosome SoK papers about LLM security and safety with large scale evaluations. This doc. is less formal but provides a structured view of papers in different categories. We use this doc. as a literature review and a paper tracker. To facilitate future research, we also provide a short discussion about potential research directions under each risk category. We will try our best to keep updating the doc. If you are interested in research along this direction and want to nail down a concrete project, check it out~
We use Google doc. rather than other (fancier) tools mainly because we use it to organize our research projects. Sorry if you feel it is a little bit old school. If you find this doc. useful, you can also consider citing it in your research papers, which will be much appreciated~ Thank you!
@article{guo2024llmSec, title = {LLM safety and security: taxonomy, status, and future}, author = {Guo, Wenbo and Nie, Yuzhou and Chen, Xuan}, journal = {henrygwb.github.io}, year = {2024}, url = {https://henrygwb.github.io/posts/llm_security.hml/} }