ArHeX: A Framework for Approximate Retrieval in Heterogeneous XML Document Collections