[unrev-II] World Wide Web Wrapper Factory

From: Jack Park (jackpark@thinkalong.com)
Date: Mon Oct 01 2001 - 17:30:18 PDT

  • Next message: Pedro M.: "Re: [unrev-II] Arundhati Roy's commentary on Sept 11 atrocity"

    http://db.cis.upenn.edu/W4F/overview.html

    "W4F is a toolkit for the generation of wrappers for Web sources.
    It consists of a retrieval language to identify Web sources, a declarative
    extraction language (HEL: HTML Extraction Language) to express robust
    extraction rules and a mapping interface to export the extracted
    information into some user-defined data-structures.
    To assist the user and make the creation of wrappers rapid and easy, the
    toolkit offers some wysiwyg support via some wizards.
    Together, they permit the fast and semi-automatic generation of ready-to-go
    wrappers.
    The wrappers are generated as Java classes.
    W4F has been successfully used to generate wrappers for database systems
    and software agents, making the content of Web sources easily accessible to
    any kind of application. "

    This one seems useful for web mining projects.

    ------------------------ Yahoo! Groups Sponsor ---------------------~-->
    Pinpoint the right security solution for your company- Learn how to add 128- bit encryption and to authenticate your web site with VeriSign's FREE guide!
    http://us.click.yahoo.com/yQix2C/33_CAA/yigFAA/IHFolB/TM
    ---------------------------------------------------------------------~->

    Community email addresses:
      Post message: unrev-II@onelist.com
      Subscribe: unrev-II-subscribe@onelist.com
      Unsubscribe: unrev-II-unsubscribe@onelist.com
      List owner: unrev-II-owner@onelist.com

    Shortcut URL to this page:
      http://www.onelist.com/community/unrev-II

    Your use of Yahoo! Groups is subject to http://docs.yahoo.com/info/terms/



    This archive was generated by hypermail 2.0.0 : Mon Oct 01 2001 - 17:25:09 PDT