Detail publikace

Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation

Originální název

Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation

Anglický název

Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation

Jazyk

en

Originální abstrakt

In our previous work we have designed a method for fast and precise Web page segmentation. In this paper we propose a complementary algorithm and data structures that extend the original design. The extension is focused on isomorphic mapping between two DOM trees. Our main objective is to improve robustness of our original solution. We successfully design and implement a solution that is more robust while keeping the efficiency of the original simple one. To prove qualities of our new design we also offer an experimental evaluation of the new implementation.

Anglický abstrakt

In our previous work we have designed a method for fast and precise Web page segmentation. In this paper we propose a complementary algorithm and data structures that extend the original design. The extension is focused on isomorphic mapping between two DOM trees. Our main objective is to improve robustness of our original solution. We successfully design and implement a solution that is more robust while keeping the efficiency of the original simple one. To prove qualities of our new design we also offer an experimental evaluation of the new implementation.

BibTex


@inproceedings{BUT103543,
  author="Jan {Zelený} and Radek {Burget}",
  title="Isomorphic mapping of DOM trees for Cluster-Based Page Segmentation",
  annote="In our previous work we have designed a method for fast and precise Web page
segmentation. In this paper we propose a complementary algorithm and data
structures that extend the original design. The extension is focused on
isomorphic mapping between two DOM trees. Our main objective is to improve
robustness of our original solution. We successfully design and implement
a solution that is more robust while keeping the efficiency of the original
simple one. To prove qualities of our new design we also offer an experimental
evaluation of the new implementation.",
  address="The University of Technology Košice",
  booktitle="Proceedings of the Twelfth International Conference on Informatics INFORMATICS'2013",
  chapter="103543",
  edition="NEUVEDEN",
  howpublished="print",
  institution="The University of Technology Košice",
  year="2013",
  month="november",
  pages="256--261",
  publisher="The University of Technology Košice",
  type="conference paper"
}