Book Description
While Web 2.0 was about data, Web 3.0 is about knowledge and information. Scripting Intelligence: Web 3.0 Information Gathering and Processing offers the reader Ruby scripts for intelligent information management in a Web 3.0 environment—including information extraction from text, using Semantic Web technologies, information gathering (relational database metadata, web scraping, Wikipedia, Freebase), combining information from multiple sources, and strategies for publishing processed information. This book will be a valuable tool for anyone needing to gather, process, and publish web or database information across the modern web environment.
- Text processing recipes, including speech tagging and automatic summarization
- Gathering, visualizing, and publishing information from the Semantic Web
- Information gathering from traditional sources such as relational databases and web sites
What you'll learn
- Gather and process information within the Web 3.0 environment.
- See the flexibility of scripting with Ruby to gather and process information.
- Extract text from various document formats.
- Work with the RDF data model and SPARQL query language, the foundations of the Semantic Web.
- Use GraphViz for data visualization.
- Extract information from relational databases and web sites.
Who is this book for?
- Anyone needing to gather and display information available in electronic formats
- Programmers needing to tag, summarize, or publish information
- Ruby programmers and computer enthusiasts interested in seeing what Ruby can do with information management and Semantic Web tools
- Academic researchers needing to extract and organize information in a more automated way.
About the Author
Mark Watson is the author of 14 books on artificial intelligence, Java, C++, UML, and Linux. He is a consultant who uses Ruby, Java, and Common Lisp. He maintains a web site at http://markwatson.com
Book Details
- Paperback: 350 pages
- Publisher: Apress; 1 edition (June 30, 2009)
- Language: English
- ISBN-10: 1430223510
- ISBN-13: 978-1430223511
- File Size: 7.6 MiB
- Hits: 1,416 times