pgsql: Add support for parsing of large XML data (>= 10MB)

From: Michael Paquier <michael(at)paquier(dot)xyz>
To: pgsql-committers(at)lists(dot)postgresql(dot)org
Subject: pgsql: Add support for parsing of large XML data (>= 10MB)
Date: 2024-01-17 05:05:21
Message-ID: E1rPy77-001nhN-0x@gemulon.postgresql.org
Views: Raw Message | Whole Thread | Download mbox | Resend email
Thread:
Lists: pgsql-committers

Add support for parsing of large XML data (>= 10MB)

This commit adds XML_PARSE_HUGE to the libxml2 functions used in core
for the parsing of XML objects, raising up the original limit of 10MB
supported by libxml2.

In most code paths of upstream, XML_MAX_TEXT_LENGTH (10^7) is the
historical limit that gets upgraded to XML_MAX_HUGE_LENGTH (10^9) once
XML_PARSE_HUGE is given to the parser calls. These are still limited by
any palloc() calls for text, up to 1GB.

This offers the possibility to handle within the backend XML objects
larger than 10MB in general, with also a higher depth limit. This
change affects the contrib module xml2, the xml data type and SQL/XML.

Author: Dmitry Koval
Reviewed-by: Tom Lane, Michael Paquier
Discussion: https://postgr.es/m/18274-98d16bc03520665f@postgresql.org

Branch
------
master

Details
-------
https://git.postgresql.org/pg/commitdiff/2197d06224a14dba544e05e3df1efcca83bdd9ef

Modified Files
--------------
contrib/xml2/xpath.c | 4 ++--
contrib/xml2/xslt_proc.c | 4 ++--
src/backend/utils/adt/xml.c | 38 +++++++++++++++++++++++++++++---------
3 files changed, 33 insertions(+), 13 deletions(-)

Browse pgsql-committers by date

  From Date Subject
Next Message Michael Paquier 2024-01-17 06:55:26 pgsql: Remove some comments related to pqPipelineSync() and PQsendPipel
Previous Message Michael Paquier 2024-01-16 23:55:06 pgsql: xml2: Replace deprecated routines with recommended ones