XML Injection (aka Blind XPath Injection)

Description

The product does not properly neutralize special elements that are used in XML, allowing attackers to modify the syntax, content, or commands of the XML before it is processed by an end system.

Extended Description

Within XML, special elements could include reserved words or characters such as "<", ">", """, and "&", which could then be used to add new data or modify XML syntax.

Common Consequences 1

Scope: ConfidentialityIntegrityAvailability

Impact: Execute Unauthorized Code or CommandsRead Application DataModify Application Data

Detection Methods 1

Automated Static AnalysisHigh

Automated static analysis, commonly referred to as Static Application Security Testing (SAST), can find some instances of this weakness by analyzing source code (or binary/compiled code) without having to execute it. Typically, this is done by building a model of data flow and control flow, then searching for potentially-vulnerable patterns that connect "sources" (origins of input) with "sinks" (destinations where the data interacts with external components, a lower layer such as the OS, etc.)

Potential Mitigations 1

Phase: Implementation

Strategy: Input Validation

Assume all input is malicious. Use an "accept known good" input validation strategy, i.e., use a list of acceptable inputs that strictly conform to specifications. Reject any input that does not strictly conform to specifications, or transform it into something that does. When performing input validation, consider all potentially relevant properties, including length, type of input, the full range of acceptable values, missing or extra inputs, syntax, consistency across related fields, and conformance to business rules. As an example of business rule logic, "boat" may be syntactically valid because it only contains alphanumeric characters, but it is not valid if the input is only expected to contain colors such as "red" or "blue." Do not rely exclusively on looking for malicious or malformed inputs. This is likely to miss at least one undesirable input, especially if the code's environment changes. This can give attackers enough room to bypass the intended validation. However, denylists can be useful for detecting potential attacks or determining which inputs are so malformed that they should be rejected outright.

References 2

Blind XPath Injection

Amit Klein

19-05-2004

https://dl.packetstormsecurity.net/papers/bypass/Blind_XPath_Injection_20040518.pdf(2023-04-07)

ID: REF-882

The Art of Software Security Assessment

Mark Dowd, John McDonald, and Justin Schuh

Addison Wesley

2006

ID: REF-62

Applicable Platforms

Languages:

Not Language-Specific : Undetermined

Modes of Introduction

Implementation

Related Attack Patterns

Related Weaknesses

ChildOf:

Improper Neutralization of Special Elements in Output Used by a Downstream Component ('Injection') (CWE-74)

ChildOf:

Improper Neutralization of Special Elements in Output Used by a Downstream Component ('Injection') (CWE-74)

Taxonomy Mapping

PLOVER
OWASP Top Ten 2007
OWASP Top Ten 2004
WASC
Software Fault Patterns

Notes

MaintenanceThe description for this entry is generally applicable to XML, but the name includes "blind XPath injection" which is more closely associated with Improper Neutralization of Data within XPath Expressions ('XPath Injection'). Therefore this entry might need to be deprecated or converted to a general category - although injection into raw XML is not covered by Improper Neutralization of Data within XPath Expressions ('XPath Injection') or Improper Neutralization of Data within XQuery Expressions ('XQuery Injection').

TheoreticalIn vulnerability theory terms, this is a representation-specific case of a Data/Directive Boundary Error.

Research GapUnder-reported. This is likely found regularly by third party code auditors, but there are very few publicly reported examples.