Your additions, corrections, reviews or reports of dead links are greatly appreciated. Please use the forms provided (anonymous submissions accepted, except for reviews).
Display list of recently updated resources (ordered by release date of latest version) or list of recently modified entries (ordered by modification date in database).
Purpose/Version/Date Concordancing program for monolingual corpora. classic: 3.2.4 / modern: 3.4.4 / dev: 3.5.0 10 Nov 2011 / 26 Jun 2015 / 14 Jul 2015 Platform/License classic/modern: Win/Linux/OSX freeware License: not specified Price/Availability free
Programming Language(s): Perl
Key features: KWIC, ADVANCED SORTING, KEY(WORD) LISTS, COLLOCATION MEASURES, VISUALISING DISTRIBUTION ACROSS FILES, HIDE TAGS, CUSTOM TOKEN/WORD DEFINITIONS
Website: AntConc Homepage (EN)
Website: Support Forum (extremely helpful and active community)
Website: Laurence Anthony\’s Homepage (EN)
Purpose/Version/Date Concordancing program for parallel corpora 1.1.0 2 Apr 2015 Platform/License Win/OSX freeware License: not specified Price/Availability free
Programming Language(s): Perl
Key features: PARALLEL CONCORDANCER, KWIC FOR SOURCE LANGUAGE – SENTENCE HIGHLIGHTING FOR TARGET LANGUAGE, ADVANCED SORTING
Website: Download Page (EN)
Website: Laurence Anthony\’s Homepage (EN)
Purpose/Version/Date Web‐based framework for distribution and analysis of single and parallel corpora 1.0 25 Sep 2012 Platform/License Win/Linux/OSX freeware / open source / available on request at http://www.laurenceanthony.net License: not specified Price/Availability free
Programming Language(s): Perl, php
Key features: MINIMAL PRE-PROCESSING REQUIRED, SERVER INSTALLATION, (PARALLEL) KWIC, SORTING, HIGHLIGHTING OF KEYWORD IN PARALLEL CORPUS, ACTIVE SENTENCE HIGHLIGHTING, WEBPARANEWS, ANTWEBCONC
Website: Laurence Anthony\’s Homepage (EN)
Website: AntWCF EN-DE (sample implemenation 2014)
Website: WebParaNews EN-JP (sample implemenation 2012)
Website: Step-by-step installation guide on Xubuntu 12.04 LTS
Purpose/Version/Date This is a software tool to annotate text information in TEI Standards Format. stable: 3.8 2 August 2010 Platform/License Win/Linux/OSX open source License: GNU GPLv3 Price/Availability free
Programming Language(s): Java
Key features: TEI XML ANNOTATION GUI, OFFLINE USE, ONLINE COLLABORATION
Website: BACKBONE Annotator Download
Website: BACKBONE Annotator – Latest Source Code
Website: BACKBONE Project Page
Website: BACKBONE Corpus Search
Purpose/Version/Date web interface to cwb stable: 3.0.16 dev: 3.2.1[r816] 26 Dec 2013 / 5 Apr 2016 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): php, mysql, Perl
Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA, WEB INTERFACE, CQP QUERIES
Website: CQPweb project page
Website: CQPWeb SVN Repository
Website: UCREL Lancaster Corpus Server (free access to a lot of corpus resources after registration, including the extended Brown-family of corpora)
Website: CQPweb at Beijing Foreign Studies University – Large Number of publicly accessible corpora (username: test, password: test)
Website: CQPweb Video Tutorials
Website: CQPwebInABox Video Tutorials
Purpose/Version/Date Parallel text alignment editor. 1.3.3 18 Feb 2016 Platform/License Win/Linux/OSX freeware License: GNU GPLv3+ Price/Availability free
Programming Language(s): C++ (GUI: Qt)
Key features: CROSSPLATFORM PARALLEL TEXT EDITOR, OPTIONAL ALIGNER INTEGRATION
Website: InterText (EN)
Website: GitHub Repo
Website: Pavel Vondřička\’s Homepage (EN)
Purpose/Version/Date Server version of parallel text alignment editor. 2.2 / 2.2.1 (dev) 14 Nov 2014 / 23 Mar 2016 Platform/License Win/Linux/OSX freeware License: GNU GPLv3+ Price/Availability free
Programming Language(s): php, mysql, C++ binaries
Key features: SERVER VERSION, PARALLEL TEXT EDITOR, OPTIONAL ALIGNER INTEGRATION
Website: InterText (EN)
Website: GitHub Repo
Website: Pavel Vondřička\’s Homepage (EN)
Purpose/Version/Date Open-source web-interface to corpus management system manatee (bonito fork with support for parallel corpora) 0.7.x 10 Apr 2016 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free IMPORTANT NOTICE: official NoSketchEngine of manatee 2.59.X / 2.107.1 are NOT SUPPORTED but a compatible fork is provided on github (see link below)
Programming Language(s): Python
Key features: SERVER INSTALLATION, SUPPORT FOR PARALLEL CORPORA
Website: KonText
Website: KonText Repository (bonito fork maintained by Czech National Corpus)
Website: NEW (May 2015):KonText compatible fork of manatee
Purpose/Version/Date Graphical interface to hunalign & alignment editor 4.1 11 Feb 2015 Platform/License Win/Linux/OSX open source License: GNU GPLv3+ Price/Availability free
Programming Language(s): Perl
Key features: EDIT AND CORRECT AUTOMATIC SENTENCE ALIGNMENTS
Website: LF Aligner project page
Purpose/Version/Date Open-source corpus management system 2.33.1-open-2.130.6-open-3.80.5 (finlib/manatee/bonito) 12 Nov 2015 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): Python, C++, Perl
Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA
Website: NoSketchEngine (bonito, manatee, finlib, open-susanne-corpus)
Website: KonText Repository (alternative front end maintained by Czech National Corpus)
Website: SketchEngine (commercial version)
Purpose/Version/Date scripts for indexing and querying parallel corpora with CWB git commit ee032d5 17 Dec 2012 Platform/License Linux/OSX open source License: GNU GPLv3+ Price/Availability free
Programming Language(s): Perl
Key features: CWB, CORPUS INDEXING, PARALLEL CORPUS QUERYING
Website: OPUS project page
Website: Uplug Repository
Purpose/Version/Date Simple web interface for querying (cwb-indexed) parallel corpora. git-commit: e985236 21 May 2015 Platform/License Linux/OSX open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): PHP, XSLT
Key features: ONLINE PARALLEL CONCORDANCER, CQP-QUERY SUPPORT, OPTIMIZED FOR MULTILINGUAL CONCORDANCES IN 3+ LANGUAGES
Website: http://parasolcorpus.org (v1)
Website: Bitbucket Repository (v1)
Website: Short Demo Clip (v1)
Purpose/Version/Date Simple web interface for querying (cwb-indexed) parallel corpora. git-commit: 29600cc 21 May 2015 Platform/License Linux/OSX open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): PHP, XSLT
Key features: ONLINE PARALLEL CONCORDANCER, CQP-QUERY SUPPORT, SIMPLE INTERFACE FOR 2-3 LANGUAGES, SUPPORT FOR SENTENCE- AND WORD-ALIGNMENT, IMPROVED INSTALLATION AND PRE-PROCESSING INSTRUCTIONS
Website: http://parasolcorpus.org (v2)
Website: Bitbucket Repository (v2)
Purpose/Version/Date Advanced Corpus Management System stable: 2.33.2-SkE-2.133.6-3.81.2 beta: 2.33.2-SkE-2.133.6-3.81.6) (finlib/manatee/bonito) 19 Jan 2016 (last version check, under constant development) Platform/License web-based commercial License: commercial Price/Availability €58.-/year (Academic single user license, own corpus quota: 1 Mio words) 30-day free trial
Programming Language(s): various
Key features: WORD SKETCHES, SOPHISTICATED COLLOCATION MEASURES, THESAURUS, PRELOADED BILLION WORD CORPORA FOR MANY LANGUAGES, EASIEST WAY TO CREATE YOUR OWN SYNTACTICALLY ANNOTATED CORPORA
Website: SketchEngine (stable)
Website: SketchEngine (beta)
Website: NoSketchEngine (open source version – reduced functionality)
Purpose/Version/Date automatic alignment pipeline for parallel treebanks source: 1.4 10 Dec 2013 Platform/License Linux open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): Python
Key features: AUTOMATIC ALIGNMENT PIPELINE, WRAPPERS FOR HUNALIGN, ZHECHEV TREEALIGNER, EXPORT TO TIGERXML AND TMX
Website: t2t-pipe Repository
Website: 4-minute Screencast
Website: Introduced in Killer, M; Sennrich, R; Volk, M (2011)
Purpose/Version/Date web interface to t2t-pipe unversioned Aug 2013 Platform/License Linux contact webmaster(at)langui.ch for source code License: not specified Price/Availability free
Programming Language(s): php
Key features: QUICK TREEBANK GENERATION FROM TWO PARALLEL TEXT FILES
Website: t2t-pipe web interface (created by Matthias Fluor and Michael Amsler)
Website: CLab Lerneinheit (in German, created by Matthias Fluor)
Purpose/Version/Date Simple concordancing program for monolingual corpora Binary: 2.9c, Source: 3.0 (beta) 20 Feb 2014 / 2 Sep 2015 Platform/License Win/Linux/OSX open source License: MIT Price/Availability free
Programming Language(s): Python (GUI: Tk)
Key features: KWIC, SORTING, WORD LISTS, ADD ONLINE WEBPAGES TO COPRUS
Website: TextSTAT (EN)
Website: TextSTAT (DE)
Purpose/Version/Date Open-source corpus query system stable: 3.0 alpha: 3.5 [r819] 25 Apr 2010 / 8 Apr 2016 Platform/License Win (from v3.1)/Linux/OSX open source License: GNU GPLv2+ Price/Availability free
Programming Language(s): C++, Perl
Key features: CORPUS MANAGEMENT, POWERFUL QUERY LANGUAGE
Website: CWB Project Home
Purpose/Version/Date NLP tools for processing (parallel) corpora stable: 0.3.8 source: git-commit 775df4a 16 Mar 2013 / 10 Jan 2016 Platform/License Linux/OSX open source License: GNU GPLv3+ Price/Availability free
Programming Language(s): Perl
Key features: COMPLETE (PARALLEL) CORPUS CREATION PIPELINE
Website: Uplug Repository
Website: OPUS project page
Purpose/Version/Date Indexing and analysis of large XML corpora stable: 1.26 4 Aug 2010 Platform/License Win/Linux/OSX open source License: GNU GPL2+ Price/Availability free
Programming Language(s): Perl
Key features: XML-AWARE, INDEXING, CONCORDANCER, DEVELOPED FOR BNC
Website: Xaira Repository (EN)
Website: All about Xaira (OUP – Help page including tutorials)
Purpose/Version/Date Web-based bitext alignment. n/a n/a Platform/License web-based free (up to 5 bitext alignments a day) License: not specified Price/Availability free service (advertising for commercial software suite)
Programming Language(s): Perl
Key features: SUPPORT FOR A MULTITUDE OF LANGUAGES AND INPUT FORMATS, TMX GENERATION, HTML DOWNLOAD, VERY FAST, EASY TO USE, LIMITED TO 5 BITEXTS A DAY, ALIGNFACTORY
Website: YouAlign