Category Archives: Python

spaCy

Extremely promising new Python NLP tool: spaCy (commercial open-source software):

Unfortunately, it is only able to deal with English input at the moment and installation on Windows seems to be tricky. The project is currently under intense development and it will be interesting to check the following links on a regular basis:

Link to github project

Link to documentation

License: AGPLv3 (free for open-source projects), changed to MIT License (27 Sep 2015)

24-07-2015 15-22-36

 

Source: http://spacy.io/index.html#detailed-speed-comparison [accessed: 24/07/2015]

Installing nltk3 for Python3 in virtual environment on Linux

[Last update: 04/01/2015]

Prerequisites: Install Python and set up virtual environment  & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt

  • (skip, if activated) Activate the virtual environment you would like to install nltk (v3) in:
  1. cd ~/venv/apy3341
    source bin/activate
  • Use pip to install the latest version of nltk:
  1. (apy3341):~$ pip install -U nltk
  • [January 2015] Workaround for users with setuptools versions >=10 (see NLTK Issue #824):
  1. (apy3341):~$ pip install setuptools==9.1
    (apy3341):~$ pip install -U nltk
  • (skip, if nltk_data already present on system) Start Python interpreter in virtual environment and download nltk_data (corpora, packages & models):
(apy3341) mki@ubuntu:~$ python
ActivePython 3.3.4.1 (ActiveState Software Inc.) based on
Python 3.3.4 (default, Feb 25 2014, 14:59:15) 
[GCC 4.0.2 20051125 (Red Hat 4.0.2-8)] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import nltk
>>> nltk.download()

pip installations of sphinx, numpy & imapclient in virtual environment on Linux work out-of-the-box

Prerequisites: Install Python and set up virtual environment  & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt

[Last update: 03/07/2014]

  • (skip, if activated) Activate the virtual environment you would like to install additional python packages in:
  1. cd ~/venv/apy3341
    source bin/activate
  • Use pip to install/update additional packages (the installation of the following packages worked out-of-the-box):
  1. bin/pip install -U sphinx
    1. bin/pip install -U numpy
    1. bin/pip install -U imapclient

Installing lxml in virtual environment on Linux

Prerequisites: Install Python and set up virtual environment  & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt

[Last update: 03/07/2014]

  • (skip, if activated) Activate the virtual environment you would like to install lxml in:
  1. cd ~/venv/apy3341
    source bin/activate
  • Install/check for lxml dependencies (for pip to be able to compile lxml from source code):
  1. sudo apt-get install libxml2-dev libxslt1-dev zlib1g zlib1g-dev
  • Use pip to install/update lxml:
  1. bin/pip install -U lxml

Installing ipython in virtual environment on Linux

Prerequisites: Install Python and set up virtual environment  & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt

[Last update: 03/07/2014]

  • (skip, if activated) Activate the virtual environment you would like to install ipython in:
  1. cd ~/venv/apy3341
    source bin/activate
  • Make sure that readline support is installed in your virtual environment (prerequisite for ipython’s colour prompts).
  • Use pip to install/update readline:
  1. bin/pip install -U ipython

Installing readline support in ActivePython virtual environment on Linux

Prerequisites: Install Python and set up virtual environment  & Install pip in virtual environment
Tested on: Xubuntu 12.04 LTS / 13.10 / 14.04 LTS (64-bit) with ActivePython installed in /opt

[Last update: 03/07/2014]

  • (skip, if activated) Activate the virtual environment you would like to install readline in:
  1. cd ~/venv/apy3341
    source bin/activate
  • Install/check for readline dependencies (for pip to be able to compile readline from source code):
  1. sudo apt-get install libncurses5-dev
  • Use pip to install/update readline:
  1. bin/pip install -U readline