Update README.md
Browse files
README.md
CHANGED
@@ -279,8 +279,8 @@ We also selected three authoritative datasets containing benign samples:
|
|
279 |
|
280 |
+ **fka/awesome-chatgpt-prompts**[<sup>[awesome]</sup>](#awesome): 203 English samples
|
281 |
+ **StrongReject-Benign**[<sup>[Chi2024]</sup>](#Chi2024): 3,800 English samples, the benign portion of StrongReject
|
282 |
-
+ **COIG-
|
283 |
-
> COIG-
|
284 |
|
285 |
By merging the six datasets listed above, we constructed a comprehensive test set to evaluate the effectiveness of the proposed method on third-party data. This test set contains:
|
286 |
|
|
|
279 |
|
280 |
+ **fka/awesome-chatgpt-prompts**[<sup>[awesome]</sup>](#awesome): 203 English samples
|
281 |
+ **StrongReject-Benign**[<sup>[Chi2024]</sup>](#Chi2024): 3,800 English samples, the benign portion of StrongReject
|
282 |
+
+ **COIG-CQIA**[<sup>[CQIA]</sup>](#CQIA): 44,694 Chinese samples
|
283 |
+
> COIG-CQIA (Chinese Open Instruction Generalist - *Quality is All You Need*) is an open-source, high-quality instruction tuning dataset designed to support human-aligned interactions in the Chinese NLP community.
|
284 |
|
285 |
By merging the six datasets listed above, we constructed a comprehensive test set to evaluate the effectiveness of the proposed method on third-party data. This test set contains:
|
286 |
|