Prasad, Jyotika and Paepcke, Andreas (2008) CoreEx: Content Extraction from Online News Articles. Technical Report. Stanford InfoLab.