1 code implementation • 19 Apr 2024 • Wenhao Huang, Chenghao Peng, Zhixu Li, Jiaqing Liang, Yanghua Xiao, Liqian Wen, Zulong Chen
We propose AutoCrawler, a two-stage framework that leverages the hierarchical structure of HTML for progressive understanding.