Anton <antonin.houska@gmail.com> wrote:
> it was actually the focal point of my considerations: whether to
> store plain text or 'something else'.
Given that there were similar issues for other hierarchical data
types, perhaps we need something similar to tsvector, but for
hierarchical data. The extra layer of abstraction might not cost
much when used for XML compared to the possible benefit with other
data. It seems likely to be a very nice fit with GiST indexes.
So under this idea, you would always have the text (or maybe byte
array?) version of the XML, and you could "shard" it to a separate
column for fast searches.
-Kevin