Category Archives: Tools
Purpose/Version/Date web interface to cwb stable: 3.2.43 dev: 3.3.15 [r1848] 31 Dec 2021 / 23 July 2023 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): php, mysql, Perl
Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA, WEB INTERFACE, CQP QUERIES
Website: CQPweb project page
Website: CQPWeb SVN Repository
Website: UCREL Lancaster Corpus Server (free access to a lot of corpus resources after registration, including the extended Brown-family of corpora)
Website: CQPweb at Beijing Foreign Studies University – Large Number of publicly accessible corpora (username: test, password: test)
Website: CQPweb Video Tutorials
Website: CQPwebInABox Video Tutorials
Purpose/Version/Date scripts for indexing and querying parallel corpora with CWB 3.10 2019-12-16 Platform/License Linux/OSX open source License: GNU GPLv3+ Price/Availability free
Programming Language(s): Perl
Key features: CWB, CORPUS INDEXING, PARALLEL CORPUS QUERYING
Website: OPUS project page
Website: Uplug Repository
Purpose/Version/Date Simple web interface for querying (cwb-indexed) parallel corpora. git-commit: e985236 21 May 2015 Platform/License Linux/OSX open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): PHP, XSLT
Key features: ONLINE PARALLEL CONCORDANCER, CQP-QUERY SUPPORT, OPTIMIZED FOR MULTILINGUAL CONCORDANCES IN 3+ LANGUAGES
Website: http://parasolcorpus.org (v1)
Website: Bitbucket Repository (v1)
Website: Short Demo Clip (v1)
Purpose/Version/Date Open-source corpus query system stable: 3.5.2 dev: 3.5 [r1840] 24 July 2022 / 17 Mar 2023 Platform/License Win (from v3.1)/Linux/OSX open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): C++, Perl
Key features: CORPUS MANAGEMENT, POWERFUL QUERY LANGUAGE
Website: CWB Project Home
t2t-pipe tutorial
t2t-pipe
Purpose/Version/Date automatic alignment pipeline for parallel treebanks source: 1.4 10 Dec 2013 Platform/License Linux open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): Python
Key features: AUTOMATIC ALIGNMENT PIPELINE, WRAPPERS FOR HUNALIGN, ZHECHEV TREEALIGNER, EXPORT TO TIGERXML AND TMX
Website: t2t-pipe Repository
Website: 4-minute Screencast
Website: Introduced in Killer, M; Sennrich, R; Volk, M (2011)
Purpose/Version/Date web interface to t2t-pipe unversioned Aug 2013 Platform/License Linux contact webmaster(at)langui.ch for source code License: not specified Price/Availability free
Programming Language(s): php
Key features: QUICK TREEBANK GENERATION FROM TWO PARALLEL TEXT FILES
Website: t2t-pipe web interface (created by Matthias Fluor and Michael Amsler)
Website: CLab Lerneinheit (in German, created by Matthias Fluor)
Related posts on langui.ch
AntConc dependencies on Ubuntu/Xubuntu 64-bit
[Last update: 17/03/2014]
If you try to launch AntConc on an Ubuntu-based 64-bit system, you get the following error message (tested with versions 3.2.4u, 3.3.5u, 3.4.0u and 3.4.1u):
./antconc3.2.4u: No such file or directory or ./AntConc: No such file or directory
The minimal dependencies I had to install in order for AntConc to start successfully on Ubuntu 64-bit were the following libraries (tested on Ubuntu 13.10 and Xubuntu 12.04 LTS):
sudo apt-get install libx11-6:i386 libxss1:i386 libxft2:i386
For TurnKey/Debian systems, see this post, for other Linux distributions see ongoing discussion on: https://groups.google.com/forum/#!forum/antconc
Xaira
Purpose/Version/Date Indexing and analysis of large XML corpora stable: 1.26 4 Aug 2010 Platform/License Win/Linux/OSX open source License: GNU GPL2+ Price/Availability free
Programming Language(s): Perl
Key features: XML-AWARE, INDEXING, CONCORDANCER, DEVELOPED FOR BNC
Website: Xaira Repository (EN)
Website: All about Xaira (OUP – Help page including tutorials)
Related posts on langui.ch
InterText
Purpose/Version/Date Parallel text alignment editor. 1.6.3 3 Nov 2020 Platform/License Win/Linux/OSX freeware License: GNU GPLv3+ Price/Availability free
Programming Language(s): C++ (GUI: Qt)
Key features: CROSSPLATFORM PARALLEL TEXT EDITOR, OPTIONAL ALIGNER INTEGRATION
Website: InterText (EN)
Website: GitHub Repo
Website: Pavel Vondřička’s Homepage (EN)
Purpose/Version/Date Server version of parallel text alignment editor. 2.3 (stable) / 2.3.1 (dev) 7 Oct 2020 / 21 Seo 2021 Platform/License Win/Linux/OSX freeware License: GNU GPLv3+ Price/Availability free
Programming Language(s): php, mysql, C++ binaries
Key features: SERVER VERSION, PARALLEL TEXT EDITOR, OPTIONAL ALIGNER INTEGRATION
Website: InterText (EN)
Website: GitHub Repo
Website: Pavel Vondřička’s Homepage (EN)
Related posts on langui.ch
TheSketchEngine
Purpose/Version/Date Open-source web-interface to corpus management system manatee (bonito fork with support for parallel corpora) 0.17.2 1 June 2023 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free IMPORTANT NOTICE: official NoSketchEngine of manatee 2.59.X / 2.107.1 are NOT SUPPORTED but a compatible fork is provided on github (see link below)
Programming Language(s): Python
Key features: SERVER INSTALLATION, SUPPORT FOR PARALLEL CORPORA
Website: KonText
Website: KonText Repository (bonito fork maintained by Czech National Corpus)
Website: KonText compatible fork of manatee / bonito / gdex / crystal-open
Purpose/Version/Date Open-source corpus management system 2.223.6-open-5.63.9-open-4.12-2.142 (manatee/bonito/gdex/crystal) 17 Apr 2023 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): Python, C++, Perl
Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA
Website: NoSketchEngine (bonito, manatee, gdex, crystal, open-susanne-corpus)
Website: KonText Repository (alternative front end maintained by Czech National Corpus)
Website: SketchEngine (commercial version)
Purpose/Version/Date Advanced Corpus Management System stable: 2.36.5-SkE-2.151.6-3.99.3 / beta: 2.36.7-SkE-2.152.1-3.101(finlib/manatee/bonito) 4 Aug 2023 (last version check, under constant development) Platform/License web-based commercial License: commercial Price/Availability €78.-/year (Academic single user license, own corpus quota: 1 Mio words) 30-day free trial
Programming Language(s): various
Key features: WORD SKETCHES, SOPHISTICATED COLLOCATION MEASURES, THESAURUS, PRELOADED BILLION WORD CORPORA FOR MANY LANGUAGES, EASIEST WAY TO CREATE YOUR OWN SYNTACTICALLY ANNOTATED CORPORA
Website: SketchEngine (stable)
Website: SketchEngine (beta)
Website: NoSketchEngine (open source version – reduced functionality)
Related posts on langui.ch
AntPConc
Purpose/Version/Date Concordancing program for parallel corpora 1.2.1 20 Dec 2017 Platform/License Win/OSX/Linux freeware License: not specified Price/Availability free
Programming Language(s): Perl
Key features: PARALLEL CONCORDANCER, KWIC FOR SOURCE LANGUAGE – SENTENCE HIGHLIGHTING FOR TARGET LANGUAGE, ADVANCED SORTING
Website: Download Page (EN)
Website: Laurence Anthony’s Homepage (EN)
Related posts on langui.ch
AntConc
Purpose/Version/Date Concordancing program for monolingual corpora. Win/Mac/Linux: 4.2.0 03 Jan 2023 Platform/License Win/Linux/macOS freeware License: AntConc-License Price/Availability free
Programming Language(s): Perl
Key features: KWIC, ADVANCED SORTING, KEY(WORD) LISTS, COLLOCATION MEASURES, VISUALISING DISTRIBUTION ACROSS FILES, HIDE TAGS, CUSTOM TOKEN/WORD DEFINITIONS
Website: AntConc Homepage (EN)
Website: Support Forum (extremely helpful and active community)
Website: Laurence Anthony’s Homepage (EN)
Related posts on langui.ch
TextStat
Purpose/Version/Date Simple concordancing program for monolingual corpora Binary: 2.9c, Source: 3.0 (beta) 20 Feb 2014 / 2 Sep 2015 Platform/License Win/Linux/OSX open source License: MIT Price/Availability free
Programming Language(s): Python (GUI: Tk)
Key features: KWIC, SORTING, WORD LISTS, ADD ONLINE WEBPAGES TO COPRUS
Website: TextSTAT (EN)
Website: TextSTAT (DE)