bplist00_WebMainResource_WebSubframeArchives ^WebResourceURL_WebResourceTextEncodingName_WebResourceMIMEType_WebResourceData_WebResourceFrameName_http://www.emeraldinsight.com/Insight/viewPDF.jsp?contentType=Article&Filename=html/Output/Published/EmeraldFullTextArticle/Pdf/2640330404.pdfUUTF-8Ytext/htmlO* Emerald journal article - PDF P_WebSubresources _{http://www.emeraldinsight.com/Insight/ViewContentServlet?Filename=/published/emeraldfulltextarticle/pdf/2640330404_ref.htmlO/ Emerald FullText Article : Structure-preserving and query-biased document summarisation for web searching

References

Alam, H., Kumar, A., Nakamura, M., Rahman, F., Tarnikova, Y., Wilcox, C. (2003), “Structured and unstructured document summarisation: design of a commercial summariser using lexical chains”, Proceedings of the Seventh International Conference on Document Analysis and Recognition, Edinburgh, pp. 1147-1152, .

Amini, M.R., Tombros, A., Usunier, N., Lalmas, M. (2007), "Learning based summarisation of XML documents", Journal of Information Retrieval, Vol. 10 No.3, pp.233-55.

Baeza-Yates, R., Ribeiro-Neto, B. (1999), Modern Information Retrieval, Addison-Wesley, New York, NY, .

Barzilay, R., Elhadad, M. (1999), "Using lexical chains for text summarisation", in Mani, I., Maybury, M.T. (Eds),Advances in Automatic Text Summarisation, MIT Press, Cambridge, MA, pp.111-21.

Berry, M.W., Browne, M. (1999), Understanding Search Engines: Mathematical Modeling and Text Retrieval, SIAM, Philadelphia, PA, .

Broder, A., Henzinger, M. (2002), "Algorithmic aspects of information retrieval on the web", in Abello, J., Pardalos, P.M., Resende, M.G.C. (Eds),Handbook of Massive Data Sets, Kluwer Academic, Dordrecht, .

Buyukkokten, O., Garcia-Molina, H., Paepcke, A. (2001), “Accordion summarisation for end-game browsing on PDAs and cellular phones”, Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Seattle, WA, pp. 213-220, .

Can, F., Kocberber, S., Balcik, E., Kaynak, C., Ocalan, H.C., Vursavas, O.M. (2008), "Information retrieval on Turkish texts", Journal of the American Society for Information Science and Technology, Vol. 59 No.3, pp.407-21.

Chaudhuri, B.B. (2006), Digital Document Processing: Major Directions and Recent Advances, Springer, London, .

Chen, Y., Ma, W.Y., Zhang, H.J. (2003), “Detecting web page structure for adaptive viewing on small form factor devices”, Proceedings of the 12th International World Wide Web Conference, Budapest, pp. 225-233, .

(2006), in Chowdhury, G.G. (Eds),Introduction to Modern Information Retrieval, Facet, London, .

Chung, C.Y., Gertz, M., Sundarsan, N. (2002), “Reverse engineering for web data: from visual to semantic structures”, Proceedings of the 18th International Conference on Data Engineering, San Jose, CA, pp. 53-63, .

Feng, J., Haffner, P., Gilbert, M. (2005), “A learning approach to discovering web page semantic structures”, Proceedings of the Eighth International Conference on Document Analysis and Recognition, Seoul, pp. 1055-1059, .

Grossman, D.A., Frieder, O. (2004), Information Retrieval: Algorithms and Heuristics, Springer, Dordrecht, .

Guo, Y., Stylios, G. (2005), "An intelligent summarisation system based on cognitive psychology", Information Sciences, Vol. 174 No.1-2, pp.1-36.

Gupta, S., Kaiser, G.E., Grimm, P., Chiang, M.F., Starren, J. (2005), "Automating content extraction of HTML documents", World Wide Web, Vol. 8 No.2, pp.179-224.

Hobson, S.P., Dorr, B.J., Monz, C., Schwartz, R. (2007), "Task-based evaluation of text summarisation using relevance prediction", Information Processing and Management, Vol. 43 No.6, pp.1482-99.

Ingwersen, P., Järvelin, K. (2005), The Turn: Integration of Information Seeking and Retrieval in Context, Springer, Dordrecht, .

Jackson, P., Moulinier, I. (2007), Natural Language Processing for Online Applications: Text Retrieval, Extraction and Categorization, John Benjamins, Amsterdam, .

Jansen, B.J., Spink, A. (2005), "An analysis of web searching by European AlltheWeb.com users", Information Processing and Management, Vol. 41 No.2, pp.361-81.

Kowalski, G.J., Maybury, M.T. (2002), Information Storage and Retrieval Systems: Theory and Implementation, Kluwer Academic, Boston, MA, .

Liang, S.F., Devlin, S., Tait, J. (2007), "Investigating sentence weighting components for automatic summarisation", Information Processing and Management, Vol. 43 No.1, pp.146-53.

Mani, I. (2001), Automatic Summarisation, John Benjamins, Amsterdam, .

(1999), in Mani, I., Maybury, M.T. (Eds),Advances in Automatic Text Summarisation, MIT Press, Cambridge, MA, .

Marcu, D. (1999), "Discourse trees are good indicators of importance in text", in Mani, I., Maybury, M.T. (Eds),Advances in Automatic Text Summarisation, MIT Press, Cambridge, MA, pp.123-36.

Marcu, D. (2000), The Theory and Practice of Discourse Parsing and Summarisation, MIT Press, Cambridge, MA, .

Markey, K. (2007), "Twenty-five years of end-user searching, part 1: research findings", Journal of the American Society for Information Science and Technology, Vol. 58 No.8, pp.1071-81.

Maynard, D., Bontcheva, K., Saggion, H., Cunningham, H., Hamza, O. (2002), “Using a text engineering framework to build an extendable and portable IE-based summarisation system”, Proceedings of the ACL Workshop on Text Summarisation, Philadelphia, PA, pp. 19-26, .

Moens, M.-F. (2002), Automatic Indexing and Abstracting of Document Texts, Kluwer Academic, Boston, MA, .

Montgomery, D.C. (2001), Design and Analysis of Experiments, John Wiley, New York, NY, .

Mukherjee, S., Yang, G., Tan, W., Ramakrishnan, I.V. (2003), “Automatic discovery of semantic structures in HTML documents”, Proceedings of the Seventh International Conference on Document Analysis and Recognition, Edinburgh, pp. 245-249, .

Niyogi, D., Srihari, N. (1995), “Knowledge-based derivation of document logical structure”, Proceedings of the Third International Conference on Document Analysis and Recognition, Montreal, pp. 472-475, .

Otterbacher, J., Radev, D., Kareem, O. (2006), “News to go: hierarchical text summarisation for mobile devices”, Proceedings of 29th Annual ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, pp. 589-596, .

Porter, M. (1980), "An algorithm for suffix stripping", Program, Vol. 14 No.3, pp.130-7.

Raggett, D., Le Hors, A., Jacobs, I. (1999), HTML 4.01 specification, available at: www.w3.org/TR/html401/ (accessed 15 February 2008), .

Sparck-Jones, K. (1999), "Automatic summarising: factors and directions", in Mani, I., Maybury, M.T. (Eds),Advances in Automatic Text Summarisation, MIT Press, Cambridge, MA, pp.1-12.

Szlávik, Z., Tombros, A., Lalmas, M. (2006), “Investigating the use of summarisation for interactive XML retrieval”, Proceedings of the 2006 ACM Symposium on Applied Computing, Dijon, pp. 1068-1072, .

Tombros, A., Sanderson, M. (1998), “Advantages of query biased summaries in information retrieval”, Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Salvador, pp. 2-10, .

Vadrevu, S., Gelgi, F., Davulcu, H. (2007), "Information extraction from web pages using presentation regularities and domain knowledge", World Wide Web, Vol. 10 No.2, pp.157-79.

Van Oostendorp, H., De Mul, S. (1996), Cognitive Aspects of Electronic Text Processing, Ablex, Norwood, NJ, .

Varadarajan, R., Hristidis, V. (2005), “Structure-based query-specific document summarisation”, Proceedings of the 14th ACM International Conference on Information and Knowledge Management, Bremen, pp. 231-232, .

White, R.W., Jose, J.M., Ruthven, I. (2003), "A task-oriented study on the influencing effects of query-biased summarisation in web searching", Information Processing and Management, Vol. 39 No.5, pp.707-33.

Xue, Y., Hu, Y., Xin, G., Song, R., Shi, S., Cao, Y., Lin, C.Y., Li, H. (2007), "Web page title extraction and its application", Information Processing and Management, Vol. 43 No.5, pp.1332-47.

Yang, C.C., Wang, F.L. (2008), "Hierarchical summarisation of large documents", Journal of the American Society for Information Science and Technology, Vol. 59 No.6, pp.887-902.

Yang, Y., Zhang, H.J. (2001), “HTML page analysis based on visual cues”, Proceedings of the Sixth International Conference on Document Analysis and Recognition, Seattle, WA, pp. 859-864, .

Yeh, J.Y., Ke, H.R., Yang, W.P., Meng, I.H. (2005), "Text summarisation using a trainable summariser and latent semantic analysis", Information Processing and Management, Vol. 41 No.1, pp.75-95.

ZrightFrame_WebResourceResponse_Dhttp://assets.emeraldinsight.com/css/ewip_emeraldfulltextarticle.cssO bplist00 X$versionT$topY$archiverX$objects_WebResourceResponse_NSKeyedArchiver &,-345MNOPQRSTUVWXYZ[\]^bcU$null  !"#$%V$classR$3R$8S$10S$11R$5R$6R$4R$7R$2R$9R$0R$1 '()$+WNS.base[NS.relative_Dhttp://assets.emeraldinsight.com/css/ewip_emeraldfulltextarticle.css./01X$classesZ$classname12UNSURLXNSObject#AB؟;* 6789CWNS.keysZNS.objects:;<=>?@AB DEFGHIJ:L ZKeep-Alive]Accept-Ranges\Content-TypeVServer^Content-LengthTDateTEtagZConnection]Last-Modified_timeout=15, max=100UbytesXtext/cssVApacheT1471_Mon, 24 Aug 2009 14:18:07 GMT_"ff59-5bf-e2edd40"_Tue, 07 Jul 2009 09:14:53 GMT./_``a2_NSMutableDictionary\NSDictionary./deef2_NSHTTPURLResponse]NSURLResponse)27:PRd  RW`knt} "-;QW`gl gXtext/cssO@media screen { body { margin:0 20px 20px 20px; padding:0 0 0 0; font: x-small Arial, Helvetica, sans-serif; } html>body { font-size: 81.3%; } h1 { font-size: 1.7em; color: #000; /*color:#4A4A4A;*/ border-bottom: 1px solid #000; padding:108px 0 12px 0px; margin-top: 10px; background: #FFF url(emerald_logo.gif) no-repeat left top; } h2, legend { font-size: 1.16em; margin :0 0 16px 0; color: #006600; font-weight:bold; } h3 { font-size: 1em; margin-top:0pt; margin-bottom:0pt; } hr { color: #000; background-color: #000; height: 1px; } .bypass { display: none; } a:link, a:visited { color: blue; } a:hover { color: #000; } abbr, acronym, span.abbr { cursor: help; border-bottom: 1px dashed #000; } ul { margin-top:4px; margin-bottom:4px; } li { list-style:square; list-style-position:inside; } .inline { display:inline; } } @media print { body { margin:0px; padding:0 0 0 0; font: x-small Arial, Helvetica, sans-serif; } html>body { font-size: 81.3%; } h1 { font-size: 1.7em; color: #000; /*color:#4A4A4A;*/ border-bottom: 1px solid #000; padding:0px 0 12px 0px; } h2, legend { font-size: 1.16em; margin :0 0 16px 0; color: #006600; font-weight:bold; } h3 { font-size: 1em; margin-top:0pt; margin-bottom:0pt; } hr { color: #000; background-color: #000; height: 1px; } .inline { display:inline; } } 5@Om=CM{|~5"5-5/585N599?n