CVE-2026-26019 PUBLISHED

@langchain/community affected by SSRF Bypass in RecursiveUrlLoader via insufficient URL origin validation

Assigner: GitHub_M
Reserved: 09.02.2026 Published: 11.02.2026 Updated: 11.02.2026

LangChain is a framework for building LLM-powered applications. Prior to 1.1.14, the RecursiveUrlLoader class in @langchain/community is a web crawler that recursively follows links from a starting URL. Its preventOutside option (enabled by default) is intended to restrict crawling to the same site as the base URL. The implementation used String.startsWith() to compare URLs, which does not perform semantic URL validation. An attacker who controls content on a crawled page could include links to domains that share a string prefix with the target, causing the crawler to follow links to attacker-controlled or internal infrastructure. Additionally, the crawler performed no validation against private or reserved IP addresses. A crawled page could include links targeting cloud metadata services, localhost, or RFC 1918 addresses, and the crawler would fetch them without restriction. This vulnerability is fixed in 1.1.14.

Metrics

CVSS Vector: CVSS:3.1/AV:N/AC:L/PR:L/UI:R/S:C/C:L/I:N/A:N
CVSS Score: 4.1

Product Status

Vendor langchain-ai
Product langchainjs
Versions
  • Version < 1.1.14 is affected

References

Problem Types

  • CWE-918: Server-Side Request Forgery (SSRF) CWE