JBootCat - v.0.2 - README Andrew Roberts (14-Aug-2006) dev [at] andy-roberts [dot] net http://www.andy-roberts.net/software/jbootcat/ Overview ======== Marco Baroni et al* released the Bootcat tool-chain as a method of acquiring specialised corpora from the Internet. Bootcat is a combination of Perl scripts and Unix text processing utilities. JBootCat aims to fully implement the Bootcat tool-chain within a user-friendly GUI application. As it is implemented in Java it is extremely portable and can be run on any OS on which the Java Runtime is installed. * BootCat references: M. Baroni and S. Bernardini. 2004. BootCaT: Bootstrapping corpora and terms from the web. Proceedings of LREC 2004, Lisbon: ELDA. 1313-1316. http://sslmit.unibo.it/~baroni/publications/lrec2004/bootcat_lrec_2004.pdf M. Baroni, A. Kilgarriff, J. Pomikalek and P. Rychly. 2006. WebBootCaT: Instant domain-specific corpora to support human translators. Proceedings of EAMT-2006 Poster Session. 247-252. http://corpora.fi.muni.cz/bootcat/publications/webbootcat_eamt2006.pdf Features ======== JBootCat is currently in very early stages and only contains basic functionality. That said, it's still adequate if your goal is to create a new corpus from the Internet from a set of seed terms (in typical Bootcat fashion). Installation ============ JBootCat doesn't come with an installer (yet!). JBootCat is distributed as a Zip file. Download from: http://www.andy-roberts.net/software/jbootcat/releases/0.2/jbootcat-0.2.zip To uncompress the file, there are many utilities. On Windows, a popular utility is WinZip. On most platforms, there are command-line tools, such as 'unzip' that can also be used. It contains the following: ./jbootcat-0.2/README.txt (you're reading this!) ./jbootcat-0.2/JBootCat.jar (Java classes) ./jbootcat-0.2/jbootcat.exe (Windows executable to launch the Java application) ./jbootcat-0.2/lib/googleapi.jar (Library for communicating with Google) ./jbootcat-0.2/lib/jtidy-r8-SNAPSHOT.jar (Library for processing HTML) ./jbootcat-0.2/lib/jTokeniser-1.2.1.jar (Tokenisation library) ./jbootcat-0.2/lib/swing-layout-1.0.jar (Library for building GUI) ./jbootcat-0.2/lib/Wizard.jar (Generic "Wizard" GUI framework) ./jbootcat-0.2/lib/yahoo_search-1.1.0.jar (Library for communicating with Yahoo! Not yet used.) Important note: In order to use jbootcat, you need to have the Java Runtime Environment installed. It requires Java 5.0 or above. To obtain Java (or update to the latest version) goto http://www.java.com and it will automatically detect the version that you need to download and install. Running ======= * Cross-platform At command-line, type: java -jar JBootCat.jar * Microsoft Windows 1. Double-click jbootcat.exe 2. Double-click JBootCat.jar Any of the above methods will cause the GUI to load. Enjoy! Contact ======= If you wish to contact the developer about JBootCat to suggest future features, bugs or anything that you want, please email me at: dev [at] andy-roberts [dot] net * Anti-spam format. Please remove all spaces, and replace '[at]' with the '@' symbol (no quotes), etc.