probabilistic data management; query; access; word search; information extraction; XML documents