By clicking “Accept”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.

Hugging Face Model Score Curation at Endor Labs

Understand how models are factored and scored at Endor Labs, new exploration tab for HuggingFace models

Open Report

View Report

Written by

Rachel Lim

Published on

November 11, 2024

Topics

AI/ML

Understand how models are factored and scored at Endor Labs, new exploration tab for HuggingFace models

Open Report

View Report

Endor Labs recently extended its Open Source Software (OSS) discovery capability to include Open Source AI models on Hugging Face. Now you can evaluate models based on activity, popularity, security, and quality to help developers select safe and sustainable models.

In this blog we will break down how our scoring system works, and how you can best leverage to select safer models.

What is “Hugging Face”?

Hugging Face is an open-source platform that provides tools and models for natural language processing (NLP) and machine learning (ML). It additionally offers a large repository of pre-trained models which can be used for tasks starting from text classification, translation, question answering, and more. It will be simple to think of this repository of models similar to the vast repositories available on GitHub, except in the artificial intelligence (AI) and machine learning context. On top of this model hub, Hugging Face also provides APIs and libraries that make it easier for developers to experiment with and deploy machine learning models.

Hugging Face API Playground

One important feature we rely on to build our scoring system are the APIs that Hugging Face have put in place for their models. They currently offer a user-friendly interface called the Hub API Playground which can be a great learning tool in using the Hugging Face APIs.

In particular, the GET /api/models endpoint returns an object with several fields that provide value when scoring the models. An example of an API GET request for the model meta-llama/Meta-Llama-3-8B-Instruct is shown below along with the fields which are returned.

example GET request for model "meta-llama/Meta-Llama-3-8B-Instruct" on the HuggingFace API Playground

Building our backend using Hugging Face model metadata

We can retrieve Hugging Face metadata for a given model from the following URL:

https://huggingface.co/api/models/”modelName”

This returns a huggingface_hub.ModelInfo object that contains the following fields detailed below.

id (str) — ID of model.
author (str, optional) — Author of the model.
sha (str, optional) — Repo SHA at this particular revision.
created_at (datetime, optional) — Date of creation of the repo on the Hub. Note that the lowest value is 2022-03-02T23:29:04.000Z, corresponding to the date when we began to store creation dates.
last_modified (datetime, optional) — Date of last commit to the repo.
private (bool) — Is the repo private.
disabled (bool, optional) — Is the repo disabled.
downloads (int) — Number of downloads of the model over the last 30 days.
downloads_all_time (int) — Cumulated number of downloads of the model since its creation.
gated (Literal["auto", "manual", False], optional) — Is the repo gated. If so, whether there is manual or automatic approval.
gguf (Dict, optional) — GGUF information of the model.
inference (Literal["cold", "frozen", "warm"], optional) — Status of the model on the inference API. Warm models are available for immediate use. Cold models will be loaded on first inference call. Frozen models are not available in Inference API.
likes (int) — Number of likes of the model.
library_name (str, optional) — Library associated with the model.
tags (List[str]) — List of tags of the model. Compared to card_data.tags, contains extra tags computed by the Hub (e.g. supported libraries, model’s arXiv).
pipeline_tag (str, optional) — Pipeline tag associated with the model.
mask_token (str, optional) — Mask token used by the model.
widget_data (Any, optional) — Widget data associated with the model.
model_index (Dict, optional) — Model index for evaluation.
config (Dict, optional) — Model configuration.
transformers_info (TransformersInfo, optional) — Transformers-specific info (auto class, processor, etc.) associated with the model.
trending_score (int, optional) — Trending score of the model.
card_data (ModelCardData, optional) — Model Card Metadata as a huggingface_hub.repocard_data.ModelCardData object.
siblings (List[RepoSibling]) — List of huggingface_hub.hf_api.RepoSibling objects that constitute the model.
spaces (List[str], optional) — List of spaces using the model.
safetensors (SafeTensorsInfo, optional) — Model’s safetensors information.

This raw information is processed and converted into a series of “score factors” that capture properties of the specific Hugging Face model that can have a positive or negative contribution to the risks that may come when using the model. We organize these score factors into four major categories that capture what we believe are key aspects of risk:

Security: Indicates the number of security-related issues a model may have. For example the RepoSibling object which holds all filenames located in the model repository gives insight to insecure file formats such as pickle (more information on pickle formats can be found here) or safe file formats such as safetensors contributing to lower or higher security scores respectively.
Activity: Indicates the level of development activity for a model observed on Hugging Face. This may be things like the number of pull requests or discussion posts found in the model’s repo. Models with higher activity scores will be more active and presumably better maintained when compared to models with a lower activity score.
Popularity: Indicates how widely a model is used in Hugging Face by tracking source code management system metrics (for example, the number of likes or downloads for the model) as well as counting how many spaces use the model. A model with a high popularity score indicates that it is used widely.
Code Quality: Indicates how well the model complies with best practices. For instance, additional metadata that may come with the ModelCardData object contain license information which can indicate quality. A model with a higher quality score has fewer code issues.

Overall, there are several factors which holistically come together to create a comprehensive score card for Hugging Face models. As with our previous work with OSS packages, we will continuously evolve these by adding more score factors and taking advantage of additional information that becomes available through the HF APIs.

Using a Large Language Model (LLM) to extract score factors

As detailed in a previous blog post here, there are several security and operational risks that come with deploying machine learning models that should also be factored into our model scoring system. However these important factors may not always be available in the metadata returned through the Hugging Face API calls. Instead much of the valuable information comes from the model README’s located inside their repository. These README files are essentially what are known as “Model Cards” and contain valuable metadata about the model. Take for instance the following risks which can come from unproven models, missing evaluation results, malicious example code, etc. These are not factors that can be obtained through the metadata coming from Hugging Face’s API as exemplified above BUT can be easily identified on the model’s README document.

Before we dive deeper into this topic, we will briefly go over what an LLM is. LLMs are a subset of machine learning models specifically designed to understand, generate, and work with human language. Popular and well-known examples of LLMs are ChatGPT, Bard, etc. These LLMs can handle complex language tasks, such as answering questions, summarizing text, translating languages, or generating human-like text given an input from the user.

The problem is that README files on the Hugging Face hub are still evolving and there are no standardized formats nor requirements for writing them up that exist as of now. As a result, some models may not contain README documents whereas others may exist but not be complete in the information we are looking for. Take the following two models with existing README documents for comparison. One can see for BAAI/bge-small-en-v1.5 the document is quite extensive revealing much about the training and evaluation process. This builds a user’s trust and understanding of the model, increasing the model’s overall score. On the other hand, we have 2Noise/ChatTTS which is much more barebone. The lack of information and insight into the process of building the model subsequently decreases the model’s quality score as well as other factors which may rely on the README.

Just using our eyes, we can easily read and identify which model README’s contain sufficient or the lack of information. However things change quickly when trying to use code to parse the unpredictable writing styles and formats which may arise from thousands upon thousands of model README’s available on Hugging Face’s hub.

Consequently, we took the idea to take advantage of LLMs themselves to do the parsing for us! Instead of trying (and failing) to manually parse all these very diverse README files, we instruct an LLM to extract key pieces of information and use this information when computing our scores.

Experimentation:

In order to ensure we were not throwing a shot in the dark here, there were efforts to validate parsed results from an LLM. The initial experiment consisted of developing a python program to retrieve the README file content of 100 models chosen from Hugging Face. The program then passed in the extracted file content along with a predefined prompt to an LLM and the response was recorded and manually verified to determine if the LLM had responded accurately. For each model, there were 7 different prompts each corresponding to a specific score factor that we wanted to curate. The experiment results were satisfactory with the percentage of average correct responses across the 7 different prompts for the models asked being 90.625%.

Example prompts asked for a specific model

Limitations:

Undisputedly, using an LLM to analyze and parse the READMEs comes with limitations. The first and most obvious being that the LLM does not respond with 100 percent accuracy.

Additionally, another limitation moving forward with using an LLM is the fact that just like human text, the responses can be unpredictable and difficult to parse from the coding standpoint. As a result, there were several prompt iterations which we worked through until ultimately arriving at a reliable and parseable response.

Mitigations:

One mitigation is the fact that the LLM can return an unreliable response that half the time causes a guessing game on how to parse the response. Even giving it requirements such as “respond in a simple YES or NO” could cause it to respond with varying filler words at times. This was causing insufficient builds of scores in our database. As a result, the LLM was given a very strict prompt in which it would need to respond in a complete JSON object which would make parsing much easier. With these efforts parsing was no longer a problem as simply the unmarshalled JSON struct fields would be accessed to obtain our score factors such as example code being present, base model being existent, etc.

When it comes to responding incorrectly, we address these with proper prompt design to minimize the false positives cases. For example when asking a question like “does this model’s README contain performance results” with a proper prompt design, it is less likely for the LLM to hallucinate and create fictional performance results. However, it is more likely to have false negatives (i.e. the model’s README has performance results but the model misses them). Nonetheless, we carefully log all the LLM interactions so we can review and evaluate the performance of the parsing to make necessary changes and address errors.

Examples in our UI

Currently in our UI, model discovery for many of the existing models on Hugging Face is available.

Additionally, scores are available to see in the categories and fields which we have discussed above. Take advantage of these resources to start safely building and deploying ML models in your code today!

We describe how to systematically mine metadata from Hugging Face to increase visibility to the vast number of models available. While our results are carefully vetted, there is always an underlying problem of trust. Most of the model metadata available in Hugging Face are essentially self-reported. When a model claims that it was trained on specific datasets, or has specific performance results, there is no easy way to verify this information. This is because at the core the model is just a large amount of binary data. To validate performance one would have to deploy the model and run tests, meaning there is no practical way to confirm that a model was trained on a given dataset just by looking at the model’s weights.

Conclusively, there is a larger story when it comes to AI-SBOMs and building an inventory of model capabilities that can be trusted but this is a blog for another day.

AI model discovery is available now under DroidGPT in the free trial, sign up now and give it a spin!

Download

The Challenge

The Solution

The Impact

Book a Demo

Welcome to the resistance

Oops! Something went wrong while submitting the form.

Book a Demo

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Book a Demo

Article

Under the Hood: People.ai's Proactive Approach to AI Security

Hear how a CISO at an AI-first company is thinking about securing AI, and how AI should improve security programs.

Click to read

Article

People.ai transforms security and compliance with Endor Labs

People.ai uses Endor Labs for application security that provides an outstanding developer experience and makes it easier (and cheaper) to hit compliance targets.

Click to read

Article

Streamline Investigation with Enriched Vulnerability Search

Endor Labs Vulnerability Search helps you investigate CVEs with enriched metadata, call paths, and precise impact analysis—resolving conflicts across public feeds.

Click to read

Article

What is AppSec? A 2025 Guide for Security Practitioners

Learn what Application Security (AppSec) is, why it matters, and how to build a modern, scalable AppSec program across the SDLC.

Click to read

Article

Cracking the Code: Solving the Challenges of C/C++ Software Composition Analysis

This whitepaper details Endor Labs’ novel approach to indexing open source dependencies and detecting vulnerabilities in C and C++ codebases.

Click to read

Article

Mysten Labs Improves DevEx with Endor Labs

Within weeks of deployment, Endor Labs helped Mysten Labs transform its application security strategy.

Click to read

Article

Under the Hood: Mysten Labs’ Strategies for Building the Most Secure Blockchain

How Mysten Labs builds secure and low-friction systems for blockchain by focusing on code ownership, usability, and AppSec strategy.

Click to read

Article

Zebra Technologies Cuts SCA Noise by 97% with Endor Labs

With fewer alerts and more accuracy, Zebra Technologies now spends more time building and less time chasing false positives.

Click to read

Article

Next-Gen SCA for C/C++: Closing the Detection Gap

A new method for identifying OSS dependencies and vulnerabilities in C/C++ with greater accuracy and precision than legacy tools.

Click to read

Article

Critical SQL Injection Vulnerability in LlamaIndex (CVE-2025-1793) – Advisory and Analysis

The critical SQL injection vulnerability in LlamaIndex shows how LLMs can be a backdoor into your vector store

Click to read

Article

AppSec’s Exploitation Era: What Verizon, Mandiant, and Datadog Are Telling Us

A breakdown of DBIR, M-Trends, and DevSecOps reports and what they reveal about the future of AppSec in the age of AI.

Click to read

Article

Benchmarking Opengrep Performance Improvements

Opengrep's improvements to rule load times resulted in 3.15x faster average scan times than Semgrep

Click to read

Article

The UK Software Security Code of Practice through a Software Supply Chain Lens

How the UK Software Security Code of Practice reshapes supply chain security—and how Endor Labs helps vendors meet its core requirements.

Click to read

Article

CVE-2025-47949 Reveals Flaw in samlify That Opens Door to SAML Single Sign-On Bypass

Information on the likelihood and impact of CVE-2025-47949

Click to read

Article

Endor Labs Policies: Developer-Friendly Security Automation

This whitepaper talks about how Endor Labs uses context-aware security policies, like finding, action, exception, and remediation policies, to reduce noise, improve remediation speed, and help developers focus on real risks.

Click to read

Article

CVE-2025-4641 is Critical, But Likely Unreachable

Critical CVE-2025-4641 in WebDriverManager likely poses low real-world risk, but it should still be on radar. Here’s what you need to know, plus quick steps to check versions, upgrade, and secure CI pipelines.

Click to read

Article

Mastering Security Automation: Exception and Remediation Policies

Learn how Endor Labs cuts through security noise, stops unnecessary build breaks, and keeps developers focused on real risks—making security policy automation truly developer-friendly.

Click to read

Article

5 Tips for Managing Bazel Dependencies (Without Losing Friends)

Upgrading dependencies in a Bazel monorepo? Learn 5 tips to avoid breakages, reduce risk, and keep your team (and builds) running smoothly.

Click to read

Article

Why Security Policies Frustrate Developers (and How We Can Fix Them)

Most security policies create more problems than they solve, overwhelming developers with noise and unnecessary build breaks. Here's what a better approach looks like.

Click to read

Article

Open Source Gets Political: What The easyjson Debate Misses (and what to do about it)

A look at the easyjson controversy, open source provenance, and how Go's built-in protections help teams manage risk without overreacting.

Click to read

Article

Why We Raised a $93M Series B (In This Market)

Endor Labs raised a $93M Series B to accelerate its mission of securing the AI-driven software era. Learn why top investors preempted the round—and how Endor is redefining AppSec for modern development.

Click to read

Article

Secure AI-Generated Code at the Source

This solution brief shows how application security teams can fix risks from AI-generated code earlier in development and become the catalyst for secure, scalable adoption of AI coding tools like GitHub Copilot and Cursor in their organizations.

Click to read

Article

AI Security Code Review: A Multi-Agent Approach for Detecting Security Design Flaws at Scale

This whitepaper introduces how AI Security Code Review works, what it detects, how it integrates into your workflows, and why it represents the next generation of code scanning technology — built for the complexity and speed of AI-native software development.

Click to read

Article

Introducing the Endor Labs MCP Server: fix-first security for the vibe coding era

Endor Labs MCP Server powers real security fixes for vibe coding and AI-generated code—reduce noise and help AI tools fix risks for you.

Click to read

Article

Introducing AI Security Code Review

Endor Labs helps application security teams identify the few code changes that impact their security architecture across thousands of pull requests.

Click to read

Article

Meet the application security platform built for the AI era

The era of vibe coding is here. Learn how Endor Labs is helping AppSec teams secure and fix AI-generated code with a new agentic AI platform.

Click to read

Article

Critical RCE Vulnerability in Apache Parquet (CVE-2025-30065) – Advisory and Analysis

Endor Labs advisory: Critical CVE-2025-30065 in Apache Parquet lets attackers run code via schema parsing. Patch now by upgrading to version 1.15.1.

Click to read

Article

OWASP OSS Risk 2: Compromise of Legitimate Package

OWASP OSS Risk 2: Explore the compromise of legitimate open-source packages, with an in-depth case study of the tj-actions/changed-files GitHub Action supply chain attack.

Click to read

Article

Blast Radius of the tj-actions/changed-files Supply Chain Attack

Analysis of the tj-actions/changed-files GitHub Actions compromise, assessing the impact and damage from the attack.

Click to read

Article

What You Need to Know About UK Cyber Essentials Certification

Cyber Essentials helps UK businesses guard against internet-based attacks and prove their security measures are truly effective.

Click to read

Article

GitHub Action tj-actions/changed-files supply chain attack: what you need to know

GitHub Action tj-actions/changed-files was compromised, exposing CI/CD secrets. Learn how this attack impacts repositories and what steps to take now.

Click to read

Article

Application Security Posture Management (ASPM) Explained

Learn when application security posture management (ASPM) solutions work, their limitations, and alternatives for cutting through security alert noise.

Click to read

Article

How Endor Patches Are Built and Tested

Endor Patches are backported open-source security fixes. Learn how we build and test Endor Patches for compatibility and security.

Click to read

Article

The AppSec Maturity Staircase: Climbing Faster, Not Harder with Endor Labs

Each stage of the application security maturity staircase evolves your program—and Endor Labs is your escalator to the top.

Click to read

Article

How to Get Developers to Accept Security PRs Faster

Improve your mean time to remediation (MTTR) with smarter automatic pull requests that use upgrade impact analysis to reduce alert fatigue for developers.

Click to read

Article

DeepSeek R1: What Security Teams Need to Know

Learn how to evaluate security risk factors for DeepSeek R1, and about important considerations for working with open source AI models.

Click to read

Article

How to Discover Open Source AI Models in Your Code

Use Endor Labs to discover, evaluate, and enforce policies governing the usage of open source AI models from Hugging Face in your applications.

Click to read

Article

Remote Code Execution Vulnerabilities in Apache Struts

CVE-2024-53677 and CVE-2023-50164 are vulnerabilities in Apache Struts that could pave the way for remote code execution, or RCE. Learn how to figure out if you’re affected, and if so what to do about it

Click to read

Article

Everything You Need to Know About Opengrep

Opengrep is a fork of Semgrep's open source static code analysis engine. Learn about the benefits and how you can contribute.

Click to read

Article

Uncover Trends and Show AppSec Value with the Endor Labs Dashboard

Vulnerability metrics can help you uncover remediation and SLA trends, and demonstrate the value of AppSec investments to your leadership.

Click to read

Article

Identifying and Tracking FedRAMP False Positives

False positives can make FedRAMP ConMon costly. Learn why it’s hard to accurately identify false positives and some tactics for making this process less challenging.

Click to read

Article

How Endor Labs Prioritizes Open Source Security Patches

Learn how Endor Labs targets the critical dependencies that are responsible for most of the open source vulnerabilities in the software supply chain.

Click to read

Article

Why Reachability Analysis for JavaScript Is Hard (and How We Fixed It)

JavaScript reachability is tricky for SCA tools because of how JavaScript approaches dependency resolution, dependency imports, and functions.

Click to read

Article

Endor Patches Whitepaper

When upgrading is too risky, complex, or time consuming due to regressions, breaking changes, or new bugs, you can use Endor Patches to stay safe now while still meeting your SLA requirements.

Click to read

Article

Grip Security Reduces Noise by 99%

Grip Security replaced their traditional SCA tool with Endor Labs to improve their ability to build trust with customers without taxing developers.

Click to read

Article

Grip Security Builds Customer Trust with AppSec

Grip Security values strong application security because it helps them build trust with their customers. Learn how a security company approaches AppSec.

Click to read

Article

The Uncomfortable Truth of Vulnerable and Outdated Software Components

Learn where common industry sayings such as “stay up to date” come from and how you can help Endor Labs help you overcome those challenges.

Click to read

Article

Reduce FedRAMP Compliance Costs

Endor Labs reduces false positives and prioritizes real vulnerabilities, helping your team meet FedRAMP requirements with less stress and lower costs.

Click to read

Article

Why OVAL Feeds Outperform NVD for Linux Vulnerability Management

Learn why OVAL feeds, curated by Linux distributions, offer more precise vulnerability data than the NVD, reducing container scanning false positives and wasted efforts.

Click to read

Article

Achieving FedRAMP’s Container Scanning Requirements

Click to read

Article

Breaking Changes, Breaking Trust

Click to read

Article

Reducing FedRAMP Compliance Costs with Endor Labs

Vulnerability Management for FedRAMP compliance is expensive; your SCA tool should help you make it cheaper and easier.

Click to read

Article

Microsoft Defender for Cloud Natively Integrates with Endor Labs

Integrate Microsoft Defender for Cloud with Endor Labs for reachability analysis and attack path visibility — available natively within the Defender for Cloud console. Prioritize what to fix without switching tools.

Click to read

Article

Hugging Face Model Score Curation at Endor Labs

Understand how models are factored and scored at Endor Labs, new exploration tab for HuggingFace models

Click to read

Article

Endor Labs Announces Integrated SAST Offerings

Endor Labs now integrates Static Application Security Testing (SAST) into your application security testing stack.

Click to read

Article

Understanding the Cyber Resilience Act

The Cyber Resilience Act (CRA) sets mandatory security requirements for hardware and software. This blog covers key compliance objectives, challenges with OSS vulnerabilities, and best practices for maintaining security throughout the product life cycle.

Click to read

Article

Start Clean With AI: Select Safer LLM Models with Endor Labs

You can now use Endor Labs to evaluate AI models on HuggingFace for security, popularity, quality, and activity.

Click to read

Article

The U.S. Government Prioritizes Open Source Governance and Security

The U.S. Federal government's FY 2026 Cybersecurity Priorities focus on securing open source software, improving governance, and supporting OSS sustainability to strengthen the software supply chain.

Click to read

Article

Understanding the Basics of Large Language Models (LLMs)

Understand what LLMs are, how foundational LLMs are built, the opportunities they offer and the risks they pose.

Click to read

Article

Container Layer Analysis: Clarity in Remediation

Container layer analysis tells you which layer contains a vulnerability so you can prioritize remediation efforts more effectively and meet SLAs like FedRAMP.

Click to read

Article

Endor Labs Achieves 92% Reduction in SCA Alerts

Endor Labs reduces open-source vulnerability noise by 92%, boosting productivity and improving collaboration between development and security teams.

Click to read

Article

Karl Mattson Joins Endor Labs as Chief Information Security Officer

We're thrilled to have Karl Mattson as Endor Labs first Chief Information Security Officer (CISO)!

Click to read

Article

Highlights from Our 2024 Dependency Management Webinar

Get key insights from the 2024 Dependency Management webinar with Darren Meyer and Henrik Plate. We discuss how to prioritize vulnerabilities, navigate breaking changes, and leverage public vulnerability databases effectively.

Click to read

Article

Relativity Blocks Risks with Endor Labs

Relativity changed their security program from a blocker to an enabler by integrating security into developer workflows and empowering developers to prevent risks before they ship to production.

Click to read

Article

Blocking with Confidence: Relativity's Dev Experience Journey

Relativity changed their security program from a blocker to an enabler by integrating security into developer workflows and empowering developers to prevent risks before they ship to production.

Click to read

Article

48 most popular open source tools for Python applications, scored

Discover the top open-source tools for Python applications, ranked by Endor Scores based on security, activity, popularity, and code quality.

Click to read

Article

FedRAMP Requirements for Vulnerability Management and Dependency Upgrades

This blog covers key steps to simplify FedRAMP vulnerability management, helping you reduce risks and meet compliance timelines. It also provides practical tips to empower developers and streamline fixes for a smoother FedRAMP process.

Click to read

Article

Fix Vulnerabilities Faster with Auto Patching and Endor Patches

Automatically patch open source libraries with Endor Patches during the build process, ensuring software is continuously protected against vulnerabilities without manual intervention.

Click to read

Article

Dependency Management Report

Click to read

Article

Announcing the 2024 Dependency Management Report

Our third-annual Dependency Management Report explores how emerging trends in open source security should guide SDLC security strategy.

Click to read

Article

Starburst Gets 98.3% Noise Reduction with Endor Labs

Starburst, an open data lakehouse, replaced Rezillion with Endor Labs for SCA. They improved their ability to identify and prioritize open source while complementing the developer experience.

Click to read

Article

Building a DevSecOps Practice at Starburst

Wondering how to build or revamp a DevSecOps program? Get some immediately useful tips that you can apply to your startup or mature enterprise…or anywhere in between.

Click to read

Article

What is CI/CD Security and What Tools Do You Need to Do it?

Learn what CI/CD security is, why it’s important, and discover the key tools Endor Labs offers to help you secure your CI/CD pipelines.

Click to read

Article

PWN Request Threat: A Hidden Danger in GitHub Actions

Endor Labs provides comprehensive CI/CD security for GitHub action workflows that detect patterns that may indicate PWN request threats.

Click to read

Article

Address Open Source Risks with Endor Labs

Click to read

Article

Endor Labs Brand Guidelines

Click to read

Article

Give Devs the Confidence to Fix: Making Remediation Less Painful

Endor Labs’ newest capabilities help you reduce the research required to understand the impact of dependency upgrades and Endor Magic Patches help you stay safe without changing versions.

Click to read

Article

Endor Labs Partners with Microsoft to Strengthen Software Supply Chains

Endor Labs is now available on Azure Marketplace!

Click to read

Article

Prioritize Open Source Risks with Endor Labs

Endor Labs provides several filters to help you prioritize which risks to address first, resulting in an average 92% noise reduction.

Click to read

Article

Discover Open Source Risks with Endor Labs

Use Endor Labs to get accurate dependency inventories and complete vulnerability data sources.

Click to read

Article

48 most popular open source tools for npm applications, scored

Discover the 48 most popular open-source npm tools, complete with Endor Scores, to help you choose the best dependencies for your projects based on security, activity, popularity, and code quality.

Click to read

Article

Benchmarking Endor Labs vs. Snyk’s GitHub Apps

Compare Endor Labs and Snyk GitHub Apps.

Click to read

Article

Using Artifact Signing to Establish Provenance for SLSA

Use artifact signing, a feature of Endor Labs, to support build provenance requirements for SLSA.

Click to read

Article

Fixed is Better than Found | Upgrades & Remediation with Endor Labs

At Endor Labs, we believe your application security tooling must go beyond alerting—it should also helpyou fast-track remediation.

Click to read

Article

How to Fix Vulnerabilities Without Breaking Changes

Click to read

Article

Introducing Upgrades & Remediation: Give Developers the Confidence to Fix

Upgrade Impact Analysis shows you what breaking changes a fix could cause. Endor Patches are trusted patches you can use when upgrades are too painful.

Click to read

Article

Static SCA vs. Dynamic SCA: Which is Better (and Why It's Neither)

Software composition analysis (SCA) tools can take a static or dynamic approach. Learn the pros and cons of each option and see how the results differ.

Click to read

Article

33 Most Popular Open Source Tools for Maven Applications, Scored

Explore the top 33 open source tools for Maven, scored by Endor Labs on security, activity, popularity, and code quality.

Click to read

Article

Endor Labs Partner Program Overview

Click to read

Article

Jellyfish Enables Data-Driven AppSec with Endor Labs

Jellyfish replaced Snyk with Endor Labs to improve their ability to identify, prioritize, address, and predict open source risk. Learn more!

Click to read

Article

Jellyfish’s Data-Driven Security Program

Learn how Jellyfish’s security team uses a data-driven approach to risk management and the role SCA plays in their strategy.

Click to read

Article

What's a Security Pipeline? - On-Demand Webinar

Learn about common patterns and tradeoffs for security pipelines in this introductory webinar.

Click to read

Article

Secure Everything Your Code Depends On With Endor Labs

While conventional code security tools drown teams in false positives, Endor Labs zeroes in on real risks, empowering developers without without slowing them down.

Click to read

Article

Endor Labs Receives Strategic Investment from Citi Ventures

Endor Labs, a leader in software supply chain security, today announced a strategic investment from Citi Ventures.

Click to read

Article

We made the Inc. Best Workplaces List for 2024!

Endor Labs is named to Inc.’s annual Best Workplaces list for 2024.

Click to read

Article

New CocoaPods CVEs: Swift and Objective-C Supply Chains Are Fragile

Three CocoaPods CVEs raise serious security concerns for consumers of Swift and Objective-C libraries used for macOS and iOS mobile development.

Click to read

Article

Questions to Ask Your Software Composition Analysis Vendor

When choosing an SCA tool, you’ll need to understand how the tool generates an inventory, correlates to risks, helps you prioritize results, and integrates into your toolchain.

Click to read

Article

Backstage and Endor Labs: AppSec in a Dev’s Dream Workspace

The Endor Labs plugins for Backstage create an application security experience that doesn’t require developers to leave Backstage.

Click to read

Article

Managing Open Source Vulnerabilities for PCI DSS Compliance - On-Demand Webinar

Watch this 30-minute on-demand webinar to learn about changes to PCI DSS that impact OSS vulnerability management.

Click to read

Article

Container Scanning + SCA = Better Together

We’re excited to announce that Endor Labs now extends our software supply chain platform to include container scanning.

Click to read

Sorry, we couldn't find what you're looking for.

View All Results

Hugging Face Model Score Curation at Endor Labs

Open Report

View Report

Open Report

View Report

Contents

Share This Resource

Related Resources

What is “Hugging Face”?

Hugging Face API Playground

Building our backend using Hugging Face model metadata

Using a Large Language Model (LLM) to extract score factors

Experimentation:

Limitations:

Mitigations:

Examples in our UI

Share This Resource

Related Resources

The Challenge

The Solution

The Impact

Book a Demo

Book a Demo

Book a Demo

Book a Demo

Book a Demo

Book a Demo

Mysten Labs Improves DevEx with Endor Labs

Critical SQL Injection Vulnerability in LlamaIndex (CVE-2025-1793) – Advisory and Analysis

Secure AI-Generated Code at the Source

Under the Hood: People.ai's Proactive Approach to AI Security

People.ai transforms security and compliance with Endor Labs

Streamline Investigation with Enriched Vulnerability Search

What is AppSec? A 2025 Guide for Security Practitioners

Cracking the Code: Solving the Challenges of C/C++ Software Composition Analysis

Mysten Labs Improves DevEx with Endor Labs

Under the Hood: Mysten Labs’ Strategies for Building the Most Secure Blockchain

Zebra Technologies Cuts SCA Noise by 97% with Endor Labs

Next-Gen SCA for C/C++: Closing the Detection Gap

Critical SQL Injection Vulnerability in LlamaIndex (CVE-2025-1793) – Advisory and Analysis

AppSec’s Exploitation Era: What Verizon, Mandiant, and Datadog Are Telling Us

Benchmarking Opengrep Performance Improvements

The UK Software Security Code of Practice through a Software Supply Chain Lens

CVE-2025-47949 Reveals Flaw in samlify That Opens Door to SAML Single Sign-On Bypass

Endor Labs Policies: Developer-Friendly Security Automation

CVE-2025-4641 is Critical, But Likely Unreachable

Mastering Security Automation: Exception and Remediation Policies

5 Tips for Managing Bazel Dependencies (Without Losing Friends)

Why Security Policies Frustrate Developers (and How We Can Fix Them)

Open Source Gets Political: What The easyjson Debate Misses (and what to do about it)

Why We Raised a $93M Series B (In This Market)

Secure AI-Generated Code at the Source

AI Security Code Review: A Multi-Agent Approach for Detecting Security Design Flaws at Scale

Introducing the Endor Labs MCP Server: fix-first security for the vibe coding era

Introducing AI Security Code Review

Meet the application security platform built for the AI era

Critical RCE Vulnerability in Apache Parquet (CVE-2025-30065) – Advisory and Analysis

OWASP OSS Risk 2: Compromise of Legitimate Package

Blast Radius of the tj-actions/changed-files Supply Chain Attack

What You Need to Know About UK Cyber Essentials Certification

GitHub Action tj-actions/changed-files supply chain attack: what you need to know

Application Security Posture Management (ASPM) Explained

How Endor Patches Are Built and Tested

The AppSec Maturity Staircase: Climbing Faster, Not Harder with Endor Labs

How to Get Developers to Accept Security PRs Faster

DeepSeek R1: What Security Teams Need to Know

How to Discover Open Source AI Models in Your Code

Remote Code Execution Vulnerabilities in Apache Struts

Everything You Need to Know About Opengrep

Uncover Trends and Show AppSec Value with the Endor Labs Dashboard

Identifying and Tracking FedRAMP False Positives

How Endor Labs Prioritizes Open Source Security Patches

Why Reachability Analysis for JavaScript Is Hard (and How We Fixed It)

Endor Patches Whitepaper

Grip Security Reduces Noise by 99%

Grip Security Builds Customer Trust with AppSec

The Uncomfortable Truth of Vulnerable and Outdated Software Components

Reduce FedRAMP Compliance Costs

Why OVAL Feeds Outperform NVD for Linux Vulnerability Management

Achieving FedRAMP’s Container Scanning Requirements