Category Archives: Tools

CQPweb tutorial (German)

Noah Bubenhofer's CQPweb Tutorial (German)

Noah Bubenhofer’s CQPweb Tutorial (German)

Linkt to Noah Bubenhofer’s CQPweb Tutorial (German)

CQPweb

2023-07-23 Developer / Project Head: Andrew Hardie
Purpose/Version/Date web interface to cwb stable: 3.2.43 dev: 3.3.15 [r1848] 31 Dec 2021 / 23 July 2023 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free Programming Language(s): php, mysql, Perl Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA, WEB INTERFACE, CQP QUERIES Website: CQPweb project page Website: CQPWeb SVN Repository Website: UCREL Lancaster Corpus Server (free access to a lot of corpus resources after registration, including the extended Brown-family of corpora) Website: CQPweb at Beijing Foreign Studies University – Large Number of publicly accessible corpora (username: test, password: test) Website: CQPweb Video Tutorials Website: CQPwebInABox Video Tutorials
Return to top.

Related posts on langui.ch:

New release: ParaVoz2

ParaVoz2

2015-07-02 Developer / Project Head: Ruprecht von Waldenfels
Purpose/Version/Date Simple web interface for querying (cwb-indexed) parallel corpora. git-commit: 98bbe99 2 July 2015 Platform/License Linux/OSX open source License: GNU GPLv2+ Price/Availability free Programming Language(s): PHP, XSLT Key features: ONLINE PARALLEL CONCORDANCER, CQP-QUERY SUPPORT, SIMPLE INTERFACE FOR 2-3 LANGUAGES, SUPPORT FOR SENTENCE- AND WORD-ALIGNMENT, IMPROVED INSTALLATION AND PRE-PROCESSING INSTRUCTIONS Website: http://parasolcorpus.org (v2) Website: Bitbucket Repository (v2)
Return to top.

Release announcement:


> Date: Thu, 21 May 2015 14:41:13 +0200
> From: ruprecht.waldenfels _(at)_ gmx.net
> To: cwb _(at)_ sslmit.unibo.it
> Subject: [CWB] Interface for parallel corpora
>
> Dear colleagues,
>
> we would like to let you know that a new version of the ParaVoz corpus
> interface for parallel corpora hosted with CWB has been released.
> ParaVoz 2.0 has a user friendly interface, it features basic metadata
> management and supports word alignment.
>
> ParaVoz 2.0 extends (but not replaces) Paravoz 1.0; it is open-source
> and found here: https://bitbucket.org/rvwfels/paravoz2
>
> A demo version is found here: www.parasolcorpus.org/ParaVoz
>
> Best,
> Ruprecht von Waldenfels
> Michał Woźniak
>
> Institute of Polish, Polish Academy of Sciences, Cracow
> _______________________________________________
> CWB mailing list
> CWB _(at)_ sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb

Related posts on langui.ch:

CQPwebInABox

CQPwebInABox running on VMware Player

CQPwebInABox running on VMware Player

Excellent news! A couple of days ago, Andrew Hardie released a virtual machine with a preconfigured version of CQPweb installed:


> From: a.hardie(*at*)lancaster.ac.uk
> To: cwb(*at*)sslmit.unibo.it
> Date: Thu, 2 Apr 2015 05:20:33 +0000
> Subject: [CWB] Announcing CQPwebInABox
>
> Hi everybody,
>
> This is just a quick note to announce the availability of CQPwebInABox
> – a virtual machine image containing a pre-installed copy of CQPweb.
>
> This is designed to get beginners past the hump of having to install
> all the different components.
>
> The image (1.6GB) can be downloaded here:
> https://sourceforge.net/projects/cwb/files/CQPwebInABox/
>
>
> To run it, you will need to install VirtualBox (although I believe
> other virtualisation tools can also use the same file format, I haven’t
> yet tested this).
>
> You can get VirtualBox here:
> https://www.virtualbox.org/wiki/Downloads
> Then “import appliance” from the .ova download.
>
> The virtual machine runs Linux – however, I have set it up in such a
> way as to make the interface as similar to Windows as possible. So
> don’t fear the Linux!
>
> I will create some video tutorials & put them on YouTube as soon as I can.
>
> Feedback welcome.
>
> best
>
> Andrew.

CQPweb

2023-07-23 Developer / Project Head: Andrew Hardie
Purpose/Version/Date web interface to cwb stable: 3.2.43 dev: 3.3.15 [r1848] 31 Dec 2021 / 23 July 2023 Platform/License web-based (also on localhost) open source License: GNU GPLv2+ Price/Availability free Programming Language(s): php, mysql, Perl Key features: SERVER INSTALLATION, MANAGE YOUR OWN CORPORA, WEB INTERFACE, CQP QUERIES Website: CQPweb project page Website: CQPWeb SVN Repository Website: UCREL Lancaster Corpus Server (free access to a lot of corpus resources after registration, including the extended Brown-family of corpora) Website: CQPweb at Beijing Foreign Studies University – Large Number of publicly accessible corpora (username: test, password: test) Website: CQPweb Video Tutorials Website: CQPwebInABox Video Tutorials
Return to top.

Related posts on langui.ch:

New Release: NoSketchEngine

The SketchEngine development team has just released a new open-source version of their tools (bonito, manatee, finlibDownload-Links), including the following highlights:

  • extended support for parallel corpora
  • support for virtual corpora
  • asynchronous query processing showing partial results as they are computed
  • corpus info page providing an overall overview of the corpus stats
  • lots of smaller enhancements in the functionality and usability of the user interface
  • lots of speed enhancements, both for run time (query evaluation) and compile time (corpus indexing)
  • lots of bugfixes

Source: http://nlp.fi.muni.cz/trac/noske/wiki/Downloads [accessed: 13/06/2014]

ParaVoz

ParaVoz

2015-05-21 Developer / Project Head: Ruprecht von Waldenfels
Purpose/Version/Date Simple web interface for querying (cwb-indexed) parallel corpora. git-commit: e985236 21 May 2015 Platform/License Linux/OSX open source License: GNU GPLv2+ Price/Availability free Programming Language(s): PHP, XSLT Key features: ONLINE PARALLEL CONCORDANCER, CQP-QUERY SUPPORT, OPTIMIZED FOR MULTILINGUAL CONCORDANCES IN 3+ LANGUAGES Website: http://parasolcorpus.org (v1) Website: Bitbucket Repository (v1) Website: Short Demo Clip (v1)
Return to top.

Related posts on langui.ch:

AntConc on TurnKey Linux Server

If you try to launch AntConc on a Debian-based 64-bit system, you get the following error message (tested with versions 3.2.4u and 3.4.1u):

./antconc3.2.4u: No such file or directory
or 
./AntConc: No such file or directory

The following steps were necessary for me to be able to start AntConc on a TurnKey Linux Server (Debian7, 64-bit) using ssh with X11-forwarding enabled (e.g. PuTTY plus Xming on Windows 8.1).

Important note: Please respect Laurence Anthony’s licensing terms and ask for permission before using AntConc in a server/group environment (see README section ‘LEGAL MATTER’ (p. 11) for details).

1) Activate i386 architecture on 64-bit systems:

apt-get install libc6-i386
dpkg --add-architecture i386

2) Install missing 32-bit libraries:

apt-get install libx11-6:i386 libxss1:i386 libxft2:i386

For Ubuntu-based systems see this post, for other Linux distributions see ongoing discussion on: https://groups.google.com/forum/#!forum/antconc

ForBetterEnglish

Class-room friendly collocations dictionary:

[Last update: 03/06/2015]

http://forbetterenglish.com/ (superseded by SkELLSketch Engine for Language Learning)

12-03-2014 14-55-48

 

12-03-2014 14-56-42

References:

  • Kilgarriff, A. (2014, March). “Corpora in the classroom without scaring the students.” British Council – EnglishAgenda Seminar. Retrieved from http://www.youtube.com/watch?v=2APIUxE_i6M [Adam’s talk starts at 1:09:35]
  • Adam Kilgarriff, Miloš Husák, Katy McAdam, Michael Rundell, Pavel Rychlý (2008). “GDEX: Automatically Finding Good Dictionary Examples in a Corpus.” In Elisenda Bernal, Janet DeCesaris (Ed.), Proceedings of the 13th EURALEX International Congress (pp. 425–432). Barcelona, Spain: Institut Universitari de Linguistica Aplicada, Universitat Pompeu Fabra. Retrieved from EURALEX 2008

 

TurnKey virtual appliances

“Turnkey Linux is a virtual appliance library that integrates and polishes the very best open source software into ready to use solutions.”

Excellent base system for CQPweb, ParaVozAntWebCorpusFramework, NoSketchEngine, etc.

Source: http://www.turnkeylinux.org/ [accessed: 03/03/2014]

  • LAMP Stack Virtual Appliance (~220MB, linux base system [Debian7], admin through convenient web-gui, accessible from any (local) machine within minutes)