What is PKU-SafeRLHF?
57/100
Trust Score (D)
⚠️ Use Caution
PKU-SafeRLHF is a AI tool that PKU-Alignment/PKU-SafeRLHF. It has a Nerq Trust Score of 57/100 (D). 176 GitHub stars. Published by PKU-Alignment. Last analyzed April 2026.
Why This Score
- ⚠️ Security: 0/100 — Some security concerns
- ⚠️ Maintenance: 0/100 — Maintenance activity is low
- ⚠️ Community: 176 stars, 8.5K downloads — Growing community
- ⚠️ Transparency: License: Not specified — No license specified
Trust & Safety Overview
57
TRUST SCORE
D
GRADE
176
STARS
8.5K
DOWNLOADS
What PKU-SafeRLHF Does
PKU-SafeRLHF is a dataset in the AI tool category. PKU-Alignment/PKU-SafeRLHF. It is published by PKU-Alignment and has no specified license. With 176 GitHub stars and 8.5K downloads, it has a growing community of users and contributors.
Who Should Use PKU-SafeRLHF
PKU-SafeRLHF is suitable for evaluation and non-critical use. Review the trust score breakdown before using in production.
Details
| Author | PKU-Alignment |
|---|---|
| Category | AI tool |
| License | Not specified |
| Type | dataset |
| Source | View on GitHub |
| Security Score | 0/100 |
| Activity Score | 0/100 |
How to Get Started
Check the trust score before installing:
curl nerq.ai/v1/preflight?target=pku-saferlhf
Setup guide · Full safety report · Production review · Is it safe?
Safer Alternatives
| Tool | Trust | Stars |
|---|---|---|
| openclaw | 84 | 218.2K |
| tensorflow | 72 | 193.9K |
| AutoGPT | 75 | 181.9K |
| n8n | 78 | 177.3K |
| ollama | 74 | 163.0K |
Frequently Asked Questions
What is PKU-SafeRLHF used for?
PKU-SafeRLHF is a AI tool tool. PKU-Alignment/PKU-SafeRLHF.
Is PKU-SafeRLHF free?
License: Check project page. PKU-SafeRLHF has 176 GitHub stars.
Is PKU-SafeRLHF safe?
PKU-SafeRLHF has a Nerq Trust Score of 57/100 (D). Use with caution.
What are alternatives to PKU-SafeRLHF?
Top alternatives: openclaw, tensorflow, AutoGPT. See full comparison.
Safety Report Is It Safe?
Alternatives Prediction
Trending Leaderboard
Discover MCP Servers
Packages Models
Stats API
Last updated April 2026. Trust scores based on automated analysis of public data.