SlideShare a Scribd company logo
This presentation is available under the Creative Commons
Attribution-ShareAlike 3.0 Unported License. Please refer to
http://www.bits.vib.be/ if you use this presentation or parts
hereof.
Introduction to Linux for
bioinformatics
Getting software
Joachim Jacob
5 and 12 May 2014
Figure:
http://worldofdtcmarketing.com/website-content-now-critical-for-search-engine-performance-pharma/health-information-online/
2 of 32

Just like Linux, most software written for Linux is
open source and free (e.g. GNU software).

Depending on your chosen distribution, it is
easier/harder to install packages (=another name
for software).
Software for Linux
http://www.makeuseof.com/pages/best-linux-software
3 of 32
Installing: use the software center
PREFERRED WAY to install
Software center (= 'app' or 'application' store)
every distribution as some kind of software manager :
search for software and click to install. The software
is automatically updated by the update manager.
Depending on the distro, a lot of bioinformatics
packages are available.
Example in Linux Mint
4 of 32
E.g Bowtie via software center
5 of 32
E.g Bowtie via software center
6 of 32
E.g Bowtie via software center
E.g Bowtie via software center
7 of 32
E.g Bowtie via software center
8 of 32
E.g Bowtie via software center
9 of 32
E.g Bowtie via software center
10 of 32
E.g Bowtie via software center
11 of 32
E.g Bowtie via software center
12 of 32
Software is fetched from repositories
http://en.wikipedia.org/wiki/Software_repository
https://help.ubuntu.com/community/Repositories/Ubuntu
On the internet, some URLs point to software repositories for
Linux distributions. You can plug in repo's in the software manager.
The repository hosts installation files for software. These are
typically .rpm (Red Hat alike) or .deb (Debian alike distro's) files.
Official repositories: secure, high-quality, malware-free !
13 of 32
Example: the Debian Med repo
→ exercise at the end of this section
DebianMed is a repository containing a lot of
bioinformatics packages for Debian-alike distro's, such as
Debian, Ubuntu, Mint,...
Debian Med repository is a PPA-type repo: a Personal
Package Archive (PPA). The link to such a repo starts with
ppa://.
14 of 32
→ PREFERRED WAY number 2
Instead of searching with the Software Manager,
the installation files (.rpm or .deb) can be
downloaded from internet separately (e.g. when
they're not (yet) in a repository). The Software
Manager will install the software contained in
these files (usually double-clicking the install
file).
Note: no secure transfer and no confirmation of
the package, so be a bit more careful.
Downloading installation files
15 of 32
Dependencies
Unix philosophy: software interconnects
Software should do one specific task, avoiding
redundant code by reusing other software code. This
creates dependencies between packages. Below a
graphical representation of dependencies of a package.
Dependencies need to be co-installed with the software
if not present. .rpm/.deb files take care of this! (and hence
also the software manager)
16 of 32
Dependency example
http://packages.debian.org/squeeze/altree
17 of 32
Exercise: example of installing a .deb file
http://packages.debian.org/squeeze/altree
→ Exercise link: 2 exercises!
1. install the multiple sequence alignment tool ClustalW.
2. install Gk-arrays
18 of 32
Software center versus package manager
A package
manager allows
more fine-tuned
package installs and
more info (e.g. repo
info, libraries with
code to be shared
between programs)
A Software Manager
offers installation of
complete sets of packages
constituting one (often
GUI) tool. It has also a
user rating functionality.
19 of 32
Uninstalling software
If your software manager has installed software, you
can delete software from within the interface.
The software manager knows exactly where files have
been installed.
20 of 32
→ LESS PREFERRED way number 1
Software can come as a compressed file, which contains
source code. This 'human readable' source code needs to
be compiled first before being usable. Compiling creates
executable machine code (a binary) which you can execute.
Scientific papers often distribute code in this form, (before
eventually being incorporated in a repository).

Usually, the source code comes as a .tar.gz or .tar.bz2 compressed file.

Compiling: a process that is carried out via the terminal (see later)
Software that is not packed
http://www.micans.org/mcl/ and http://sourceforge.net/projects/transpose/
21 of 32
Software that is not packed
Issues:
- You need to install the
dependencies yourself (best via
the software manager).
- Be organised: in which folder
will you put the software?
- After the software is compiled,
you need to make it available in
your system. (see later)
- It is not easy to delete the
software: you need to manually
remove the files everywhere you
have put them.
Summary: a lot of hassle!
22 of 32
→ LESS PREFERRED way number 2
Sometimes, software is compiled for you, and the
resulting binary file can be downloaded.
Attention: the binary that is build needs to match
your machine architecture (usually 64 bit).
Software that is not packed
Example: http://homepage.tudelft.nl/19j49/t-SNE.html clustering tool
A folder called
'bin' contains
executable
binary files (the
program files)
The binary file
23 of 32

Normally, you cannot run downloaded binaries as
a safety measure. You need to set the permission
for this file to 'executable'. Easily do this by: right-
click properties→

If your program is designed to work in a graphical
environment, double-clicking the binary launches
the program.
How to run binaries?
24 of 32
●
Every program,installed through packages, or
installed manually, also on your desktop, can be
started on the command line in the terminal.
●
Some binaries, and most bioinformatics
programs can ONLY be run from the command
line (covered later in detail).
A program is a binary file
25 of 32
●
Some bioinformatics programs come as Java
code: program.jar
●
To run this, double-click, or type:
$ java -jar program.jar
●
Example: Picard tools (http://picard.sourceforge.net)
Java programs are binary files
26 of 32

Commercial software is usually binary only
(protection of the source code) – installation
instructions are provided by the vendor.

Free (Open Source) Software is usually
distributed in source packages.

Packaging, the process of creating .rpm or .deb
packages takes a lot of time, therefor often the
source code to compile for yourself is provided.
Why different methods of distributing?
27 of 32

Software exists that reads in text files containing
instructions, to be executed by the computer.
These text files are called scripts. They are not
binary files. But they are executable.

E.g. perl, python, R, bash
Scripts are human readable programs
A script
which just
contains text,
and can be
interpreted
by perl
28 of 32
Scripts are human readable programs
Scripting languages are very popular in bioinformatics,
because of their relatively low barrier to get starting,
their platform independence, and quick and dirty
approach (in the case of Perl), and easy sharing: just
download the script and execute it (permissions: read
and execute).
29 of 32

Software manager...

… taps software from different places,
called repositories.

You can add repositories by adding the
URL

Software is compiled and installed on your
machine, either by the manager, or
manually.

… or it consists of scripts, which are
interpreted by an interpreter real-time.
Summary of this section
30 of 32
Keywords of this section
software center
package manager
Repositories / software sources
dependencies
RPM and deb files
compiling of source code
binaries
packaging
executable
scripts
Write in your own words what the terms mean
31 of 32
Exercise: Getting software
→ Adding software sources containing bioinformatics packages
→ Install software from a package file graphically.
→ Install the very good text editor Geany (PPA exercise)
→ Install the very good terminal program Terminator (Software
center)
→ Good bioinformatics packages: Ugene – interface to many
algorithms.
32 of 32

More Related Content

PDF
Part 4 of 'Introduction to Linux for bioinformatics': Managing data
PDF
Part 5 of "Introduction to Linux for Bioinformatics": Working the command lin...
PDF
Part 1 of 'Introduction to Linux for bioinformatics': Introduction
PDF
Part 6 of "Introduction to linux for bioinformatics": Productivity tips
PDF
Introduction to Linux for bioinformatics
PDF
Managing your data - Introduction to Linux for bioinformatics
PDF
The structure of Linux - Introduction to Linux for bioinformatics
PPT
A Quick Introduction to Linux
Part 4 of 'Introduction to Linux for bioinformatics': Managing data
Part 5 of "Introduction to Linux for Bioinformatics": Working the command lin...
Part 1 of 'Introduction to Linux for bioinformatics': Introduction
Part 6 of "Introduction to linux for bioinformatics": Productivity tips
Introduction to Linux for bioinformatics
Managing your data - Introduction to Linux for bioinformatics
The structure of Linux - Introduction to Linux for bioinformatics
A Quick Introduction to Linux

What's hot (20)

ODP
Linux Introduction (Commands)
PPT
1.2 boot the system v2
PPT
Linux Commands
PDF
An Introduction To Linux
PPTX
Linux basics part 1
PPT
Linux presentation
PDF
Linux Directory Structure
PDF
Linux directory structure by jitu mistry
PPT
PDF
50 most frequently used unix linux commands (with examples)
PPTX
Linux Command Suumary
PPTX
Unix OS & Commands
DOCX
Linux basic commands tutorial
PDF
LINUX Admin Quick Reference
ODP
Linux commands
PPT
101 4.1 create partitions and filesystems
PPT
101 4.2 maintain the integrity of filesystems
PPT
Linux commands
PDF
Basic unix commands
PPTX
Unix Linux Commands Presentation 2013
Linux Introduction (Commands)
1.2 boot the system v2
Linux Commands
An Introduction To Linux
Linux basics part 1
Linux presentation
Linux Directory Structure
Linux directory structure by jitu mistry
50 most frequently used unix linux commands (with examples)
Linux Command Suumary
Unix OS & Commands
Linux basic commands tutorial
LINUX Admin Quick Reference
Linux commands
101 4.1 create partitions and filesystems
101 4.2 maintain the integrity of filesystems
Linux commands
Basic unix commands
Unix Linux Commands Presentation 2013
Ad

Similar to Part 2 of 'Introduction to Linux for bioinformatics': Installing software (20)

PPT
BITS: Introduction to Linux - Software installation the graphical and the co...
PDF
Installing Software, Part 1 - Repositories
PDF
$ make install
PDF
How to write bioinformatics software people will use and cite - t.seemann - ...
PDF
How to write bioinformatics software no one will use
PDF
Course 102: Lecture 22: Package Management
PDF
PDF
PDF
The daemon in puppets
PDF
PuppetCamp SEA 1 - Puppet & FreeBSD
PDF
PuppetCamp SEA 1 - Puppet & FreeBSD
PPT
Linux Intro
PPT
Linux for everyone
PPTX
Oracle Solaris 11 - Best for Enterprise Applications
PDF
24HOP Introduction to Linux for SQL Server DBAs
PPTX
Linux; Operating System
PPT
Linux
BITS: Introduction to Linux - Software installation the graphical and the co...
Installing Software, Part 1 - Repositories
$ make install
How to write bioinformatics software people will use and cite - t.seemann - ...
How to write bioinformatics software no one will use
Course 102: Lecture 22: Package Management
The daemon in puppets
PuppetCamp SEA 1 - Puppet & FreeBSD
PuppetCamp SEA 1 - Puppet & FreeBSD
Linux Intro
Linux for everyone
Oracle Solaris 11 - Best for Enterprise Applications
24HOP Introduction to Linux for SQL Server DBAs
Linux; Operating System
Linux
Ad

More from Joachim Jacob (8)

ODP
Korte handleiding van de Partago app
ODP
Blaas nieuw leven in je PC met Linux
ODP
The Galaxy toolshed
PDF
Part 6 of RNA-seq for DE analysis: Detecting biology from differential expres...
PDF
Part 5 of RNA-seq for DE analysis: Detecting differential expression
PDF
Part 2 of RNA-seq for DE analysis: Investigating raw data
PDF
Part 1 of RNA-seq for DE analysis: Defining the goal
PDF
Part 4 of RNA-seq for DE analysis: Extracting count table and QC
Korte handleiding van de Partago app
Blaas nieuw leven in je PC met Linux
The Galaxy toolshed
Part 6 of RNA-seq for DE analysis: Detecting biology from differential expres...
Part 5 of RNA-seq for DE analysis: Detecting differential expression
Part 2 of RNA-seq for DE analysis: Investigating raw data
Part 1 of RNA-seq for DE analysis: Defining the goal
Part 4 of RNA-seq for DE analysis: Extracting count table and QC

Recently uploaded (20)

PPTX
BIOMOLECULES PPT........................
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
Pharmacology of Autonomic nervous system
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PDF
Sciences of Europe No 170 (2025)
PDF
Placing the Near-Earth Object Impact Probability in Context
PPT
protein biochemistry.ppt for university classes
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PDF
HPLC-PPT.docx high performance liquid chromatography
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
2Systematics of Living Organisms t-.pptx
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
famous lake in india and its disturibution and importance
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
BIOMOLECULES PPT........................
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Pharmacology of Autonomic nervous system
7. General Toxicologyfor clinical phrmacy.pptx
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
Sciences of Europe No 170 (2025)
Placing the Near-Earth Object Impact Probability in Context
protein biochemistry.ppt for university classes
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
HPLC-PPT.docx high performance liquid chromatography
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
2Systematics of Living Organisms t-.pptx
. Radiology Case Scenariosssssssssssssss
famous lake in india and its disturibution and importance
Introduction to Cardiovascular system_structure and functions-1
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
ECG_Course_Presentation د.محمد صقران ppt
The KM-GBF monitoring framework – status & key messages.pptx

Part 2 of 'Introduction to Linux for bioinformatics': Installing software

  • 1. This presentation is available under the Creative Commons Attribution-ShareAlike 3.0 Unported License. Please refer to http://www.bits.vib.be/ if you use this presentation or parts hereof. Introduction to Linux for bioinformatics Getting software Joachim Jacob 5 and 12 May 2014 Figure: http://worldofdtcmarketing.com/website-content-now-critical-for-search-engine-performance-pharma/health-information-online/
  • 2. 2 of 32  Just like Linux, most software written for Linux is open source and free (e.g. GNU software).  Depending on your chosen distribution, it is easier/harder to install packages (=another name for software). Software for Linux http://www.makeuseof.com/pages/best-linux-software
  • 3. 3 of 32 Installing: use the software center PREFERRED WAY to install Software center (= 'app' or 'application' store) every distribution as some kind of software manager : search for software and click to install. The software is automatically updated by the update manager. Depending on the distro, a lot of bioinformatics packages are available. Example in Linux Mint
  • 4. 4 of 32 E.g Bowtie via software center
  • 5. 5 of 32 E.g Bowtie via software center
  • 6. 6 of 32 E.g Bowtie via software center E.g Bowtie via software center
  • 7. 7 of 32 E.g Bowtie via software center
  • 8. 8 of 32 E.g Bowtie via software center
  • 9. 9 of 32 E.g Bowtie via software center
  • 10. 10 of 32 E.g Bowtie via software center
  • 11. 11 of 32 E.g Bowtie via software center
  • 12. 12 of 32 Software is fetched from repositories http://en.wikipedia.org/wiki/Software_repository https://help.ubuntu.com/community/Repositories/Ubuntu On the internet, some URLs point to software repositories for Linux distributions. You can plug in repo's in the software manager. The repository hosts installation files for software. These are typically .rpm (Red Hat alike) or .deb (Debian alike distro's) files. Official repositories: secure, high-quality, malware-free !
  • 13. 13 of 32 Example: the Debian Med repo → exercise at the end of this section DebianMed is a repository containing a lot of bioinformatics packages for Debian-alike distro's, such as Debian, Ubuntu, Mint,... Debian Med repository is a PPA-type repo: a Personal Package Archive (PPA). The link to such a repo starts with ppa://.
  • 14. 14 of 32 → PREFERRED WAY number 2 Instead of searching with the Software Manager, the installation files (.rpm or .deb) can be downloaded from internet separately (e.g. when they're not (yet) in a repository). The Software Manager will install the software contained in these files (usually double-clicking the install file). Note: no secure transfer and no confirmation of the package, so be a bit more careful. Downloading installation files
  • 15. 15 of 32 Dependencies Unix philosophy: software interconnects Software should do one specific task, avoiding redundant code by reusing other software code. This creates dependencies between packages. Below a graphical representation of dependencies of a package. Dependencies need to be co-installed with the software if not present. .rpm/.deb files take care of this! (and hence also the software manager)
  • 16. 16 of 32 Dependency example http://packages.debian.org/squeeze/altree
  • 17. 17 of 32 Exercise: example of installing a .deb file http://packages.debian.org/squeeze/altree → Exercise link: 2 exercises! 1. install the multiple sequence alignment tool ClustalW. 2. install Gk-arrays
  • 18. 18 of 32 Software center versus package manager A package manager allows more fine-tuned package installs and more info (e.g. repo info, libraries with code to be shared between programs) A Software Manager offers installation of complete sets of packages constituting one (often GUI) tool. It has also a user rating functionality.
  • 19. 19 of 32 Uninstalling software If your software manager has installed software, you can delete software from within the interface. The software manager knows exactly where files have been installed.
  • 20. 20 of 32 → LESS PREFERRED way number 1 Software can come as a compressed file, which contains source code. This 'human readable' source code needs to be compiled first before being usable. Compiling creates executable machine code (a binary) which you can execute. Scientific papers often distribute code in this form, (before eventually being incorporated in a repository).  Usually, the source code comes as a .tar.gz or .tar.bz2 compressed file.  Compiling: a process that is carried out via the terminal (see later) Software that is not packed http://www.micans.org/mcl/ and http://sourceforge.net/projects/transpose/
  • 21. 21 of 32 Software that is not packed Issues: - You need to install the dependencies yourself (best via the software manager). - Be organised: in which folder will you put the software? - After the software is compiled, you need to make it available in your system. (see later) - It is not easy to delete the software: you need to manually remove the files everywhere you have put them. Summary: a lot of hassle!
  • 22. 22 of 32 → LESS PREFERRED way number 2 Sometimes, software is compiled for you, and the resulting binary file can be downloaded. Attention: the binary that is build needs to match your machine architecture (usually 64 bit). Software that is not packed Example: http://homepage.tudelft.nl/19j49/t-SNE.html clustering tool A folder called 'bin' contains executable binary files (the program files) The binary file
  • 23. 23 of 32  Normally, you cannot run downloaded binaries as a safety measure. You need to set the permission for this file to 'executable'. Easily do this by: right- click properties→  If your program is designed to work in a graphical environment, double-clicking the binary launches the program. How to run binaries?
  • 24. 24 of 32 ● Every program,installed through packages, or installed manually, also on your desktop, can be started on the command line in the terminal. ● Some binaries, and most bioinformatics programs can ONLY be run from the command line (covered later in detail). A program is a binary file
  • 25. 25 of 32 ● Some bioinformatics programs come as Java code: program.jar ● To run this, double-click, or type: $ java -jar program.jar ● Example: Picard tools (http://picard.sourceforge.net) Java programs are binary files
  • 26. 26 of 32  Commercial software is usually binary only (protection of the source code) – installation instructions are provided by the vendor.  Free (Open Source) Software is usually distributed in source packages.  Packaging, the process of creating .rpm or .deb packages takes a lot of time, therefor often the source code to compile for yourself is provided. Why different methods of distributing?
  • 27. 27 of 32  Software exists that reads in text files containing instructions, to be executed by the computer. These text files are called scripts. They are not binary files. But they are executable.  E.g. perl, python, R, bash Scripts are human readable programs A script which just contains text, and can be interpreted by perl
  • 28. 28 of 32 Scripts are human readable programs Scripting languages are very popular in bioinformatics, because of their relatively low barrier to get starting, their platform independence, and quick and dirty approach (in the case of Perl), and easy sharing: just download the script and execute it (permissions: read and execute).
  • 29. 29 of 32  Software manager...  … taps software from different places, called repositories.  You can add repositories by adding the URL  Software is compiled and installed on your machine, either by the manager, or manually.  … or it consists of scripts, which are interpreted by an interpreter real-time. Summary of this section
  • 30. 30 of 32 Keywords of this section software center package manager Repositories / software sources dependencies RPM and deb files compiling of source code binaries packaging executable scripts Write in your own words what the terms mean
  • 31. 31 of 32 Exercise: Getting software → Adding software sources containing bioinformatics packages → Install software from a package file graphically. → Install the very good text editor Geany (PPA exercise) → Install the very good terminal program Terminator (Software center) → Good bioinformatics packages: Ugene – interface to many algorithms.