Category Archives: Tools

All existing entries in the corpus tool database have been updated

After a long rather inactive period on langui.ch, all the corpus tool entries in the database have been updated. The following sort order should enable visitors to figure out which projects are still actively developed and which projects are no longer maintained:

Focus week: Datavisualisation & Corpus Linguistics

Class: I1b (KSH)

17 April 2018 – 20 April 2018

Fünf in der Sprachwissenschaft häufig anzutreffende Visualisierungstypen: Liste, Karte, Partitur, Vektoren, Graph/Netz (gerichtet: Baum; ungerichtet: Netz) [Bubenhofer/Kupietz 2018: 46]

CQPweb tutorial (German)

Noah Bubenhofer's CQPweb Tutorial (German)

Noah Bubenhofer’s CQPweb Tutorial (German)

Linkt to Noah Bubenhofer’s CQPweb Tutorial (German)

CQPweb

2024-11-06 Developer / Project Head: Andrew Hardie
Purpose/Version/Date web interface to cwb stable: 3.2.43 dev: 3.3.15 6 November 2024 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free Programming Language(s): php, mysql, Perl Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA, WEB INTERFACE, CQP QUERIES Website: CQPweb project page Website: CQPWeb SVN Repository Website: UCREL Lancaster Corpus Server (free access to a lot of corpus resources after registration, including the extended Brown-family of corpora) Website: CQPweb at Beijing Foreign Studies University – Large Number of publicly accessible corpora (username: test, password: test) Website: CQPweb Video Tutorials Website: CQPwebInABox Video Tutorials
Return to top.

Related posts on langui.ch:

New release: ParaVoz2

ParaVoz2

2015-07-02 Developer / Project Head: Ruprecht von Waldenfels
Purpose/Version/Date Simple web interface for querying (cwb-indexed) parallel corpora. git-commit: 98bbe99 2 July 2015 Platform/License Linux/OSX open source License: GNU GPLv2+ Price/Availability free Programming Language(s): PHP, XSLT Key features: ONLINE PARALLEL CONCORDANCER, CQP-QUERY SUPPORT, SIMPLE INTERFACE FOR 2-3 LANGUAGES, SUPPORT FOR SENTENCE- AND WORD-ALIGNMENT, IMPROVED INSTALLATION AND PRE-PROCESSING INSTRUCTIONS Website: http://parasolcorpus.org (v2) Website: Bitbucket Repository (v2)
Return to top.

Release announcement:


> Date: Thu, 21 May 2015 14:41:13 +0200
> From: ruprecht.waldenfels _(at)_ gmx.net
> To: cwb _(at)_ sslmit.unibo.it
> Subject: [CWB] Interface for parallel corpora
>
> Dear colleagues,
>
> we would like to let you know that a new version of the ParaVoz corpus
> interface for parallel corpora hosted with CWB has been released.
> ParaVoz 2.0 has a user friendly interface, it features basic metadata
> management and supports word alignment.
>
> ParaVoz 2.0 extends (but not replaces) Paravoz 1.0; it is open-source
> and found here: https://bitbucket.org/rvwfels/paravoz2
>
> A demo version is found here: www.parasolcorpus.org/ParaVoz
>
> Best,
> Ruprecht von Waldenfels
> Michał Woźniak
>
> Institute of Polish, Polish Academy of Sciences, Cracow
> _______________________________________________
> CWB mailing list
> CWB _(at)_ sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb

Related posts on langui.ch:

A (brief) history of computerised corpus tools by Mura Nava

Source: http://www.perezparedes.es/big-data-and-corpus-linguistics [accessed: 14/05/2015]

CQPwebInABox

CQPwebInABox running on VMware Player

CQPwebInABox running on VMware Player

Excellent news! A couple of days ago, Andrew Hardie released a virtual machine with a preconfigured version of CQPweb installed:


> From: a.hardie(*at*)lancaster.ac.uk
> To: cwb(*at*)sslmit.unibo.it
> Date: Thu, 2 Apr 2015 05:20:33 +0000
> Subject: [CWB] Announcing CQPwebInABox
>
> Hi everybody,
>
> This is just a quick note to announce the availability of CQPwebInABox
> – a virtual machine image containing a pre-installed copy of CQPweb.
>
> This is designed to get beginners past the hump of having to install
> all the different components.
>
> The image (1.6GB) can be downloaded here:
> https://sourceforge.net/projects/cwb/files/CQPwebInABox/
>
>
> To run it, you will need to install VirtualBox (although I believe
> other virtualisation tools can also use the same file format, I haven’t
> yet tested this).
>
> You can get VirtualBox here:
> https://www.virtualbox.org/wiki/Downloads
> Then “import appliance” from the .ova download.
>
> The virtual machine runs Linux – however, I have set it up in such a
> way as to make the interface as similar to Windows as possible. So
> don’t fear the Linux!
>
> I will create some video tutorials & put them on YouTube as soon as I can.
>
> Feedback welcome.
>
> best
>
> Andrew.

CQPweb

2024-11-06 Developer / Project Head: Andrew Hardie
Purpose/Version/Date web interface to cwb stable: 3.2.43 dev: 3.3.15 6 November 2024 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free Programming Language(s): php, mysql, Perl Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA, WEB INTERFACE, CQP QUERIES Website: CQPweb project page Website: CQPWeb SVN Repository Website: UCREL Lancaster Corpus Server (free access to a lot of corpus resources after registration, including the extended Brown-family of corpora) Website: CQPweb at Beijing Foreign Studies University – Large Number of publicly accessible corpora (username: test, password: test) Website: CQPweb Video Tutorials Website: CQPwebInABox Video Tutorials
Return to top.

Related posts on langui.ch:

New Release: NoSketchEngine

The SketchEngine development team has just released a new open-source version of their tools (bonito, manatee, finlibDownload-Links), including the following highlights:

  • extended support for parallel corpora
  • support for virtual corpora
  • asynchronous query processing showing partial results as they are computed
  • corpus info page providing an overall overview of the corpus stats
  • lots of smaller enhancements in the functionality and usability of the user interface
  • lots of speed enhancements, both for run time (query evaluation) and compile time (corpus indexing)
  • lots of bugfixes

Source: http://nlp.fi.muni.cz/trac/noske/wiki/Downloads [accessed: 13/06/2014]

ParaVoz

ParaVoz

2015-05-21 Developer / Project Head: Ruprecht von Waldenfels
Purpose/Version/Date Simple web interface for querying (cwb-indexed) parallel corpora. git-commit: e985236 21 May 2015 Platform/License Linux/OSX open source License: GNU GPLv2+ Price/Availability free Programming Language(s): PHP, XSLT Key features: ONLINE PARALLEL CONCORDANCER, CQP-QUERY SUPPORT, OPTIMIZED FOR MULTILINGUAL CONCORDANCES IN 3+ LANGUAGES Website: http://parasolcorpus.org (v1) Website: Bitbucket Repository (v1) Website: Short Demo Clip (v1)
Return to top.

Related posts on langui.ch:

AntConc on TurnKey Linux Server

If you try to launch AntConc on a Debian-based 64-bit system, you get the following error message (tested with versions 3.2.4u and 3.4.1u):

./antconc3.2.4u: No such file or directory
or 
./AntConc: No such file or directory

The following steps were necessary for me to be able to start AntConc on a TurnKey Linux Server (Debian7, 64-bit) using ssh with X11-forwarding enabled (e.g. PuTTY plus Xming on Windows 8.1).

Important note: Please respect Laurence Anthony’s licensing terms and ask for permission before using AntConc in a server/group environment (see README section ‘LEGAL MATTER’ (p. 11) for details).

1) Activate i386 architecture on 64-bit systems:

apt-get install libc6-i386
dpkg --add-architecture i386

2) Install missing 32-bit libraries:

apt-get install libx11-6:i386 libxss1:i386 libxft2:i386

For Ubuntu-based systems see this post, for other Linux distributions see ongoing discussion on: https://groups.google.com/forum/#!forum/antconc

ForBetterEnglish

Class-room friendly collocations dictionary:

[Last update: 03/06/2015]

http://forbetterenglish.com/ (superseded by SkELLSketch Engine for Language Learning)

12-03-2014 14-55-48

 

12-03-2014 14-56-42

References:

  • Kilgarriff, A. (2014, March). “Corpora in the classroom without scaring the students.” British Council – EnglishAgenda Seminar. Retrieved from http://www.youtube.com/watch?v=2APIUxE_i6M [Adam’s talk starts at 1:09:35]
  • Adam Kilgarriff, Miloš Husák, Katy McAdam, Michael Rundell, Pavel Rychlý (2008). “GDEX: Automatically Finding Good Dictionary Examples in a Corpus.” In Elisenda Bernal, Janet DeCesaris (Ed.), Proceedings of the 13th EURALEX International Congress (pp. 425–432). Barcelona, Spain: Institut Universitari de Linguistica Aplicada, Universitat Pompeu Fabra. Retrieved from EURALEX 2008