I wish to write a script (i.e. computer program) that can go to a webpage, extract the HTML code, and remove all advertisements, graphics and supplementary information - leaving only the key content of that page. This script will need an understanding of the structure of the HTML page and also some artificial intelligence. I will send the URL of the webpage to this script and it should return the text contained in the webpage.
If any expert knows how to write this script or wish to give it a try, send an e-mail to kinlian@gmail.com.
No comments:
Post a Comment
Contoh Makalah Jurnal Skripsi Tesis
PDF Download PDF Search Engine
Art Gallery Artist - Contemporary Abstract Paintings and Graphics
History of Art, Artists & Art Movements
Top 30 Hot Music Downloads
Top Digital Songs
Christian Residential Drug Treatment
Donate Your Car San Francisco
Firm Law Mesothelioma Texas
Ms Exchange Server Hosting
Villa di Piazzano Cortona Italy Hotel
Windows Download Software
Windows Download Center
plastic surgery before and after korean
Fashion N style
Aliving Room Furniture
The Hotels Las Vegas
Acamping Sites
About Hilton Hotels
Women Hair Styles Short
Hair Styles Short Medium
2010 Haircuts Style
Hair Styles Short Hair
Insurance Quotes Online
Free Download Software
Cars Wallpapers
FreeCars Wallpapers
Health Insurance
Android Download
Android Download
Free Cars Wallpaper
Note: Only a member of this blog may post a comment.