SlideShare a Scribd company logo
Introduction to Data
Collection & General
Transit Feed
Specification
University of Thessaly,2015
GreenYourMove Project
With the contribution of the LIFE programme of the European Union - LIFE14 ENV/GR/000611
The Big picture
2
How are GTFS produced?
3
Contents of this presentation
 Definitions and Terminology of:
 Stop, Routes, Trips, Layers/Shapefiles, CRS, layer features, csv files
 OpenStreetMap
 General Transit Feed Specification(GTFS)
 QuantumGIS(QGIS) & OpenLayers Plugin
 PostgreSQL, PostGIS & Shapefile and DBF Loader Exporter
 shp2GTFS
 Basic usage of:
 QGIS
 PostgreSQL & PostGIS
 shp2GTFS
 Conclusion and final results
4
Definitions and Terminology
 Stop: An object that is defined by a geographic point on Earth and a name/ID, and is usually
served/contained in public transit route.
 Route: A sequence of stops that are served by a specific vehicle one after another. Information
in a route are geospatial objects with IDs and are time-independent .
 Trips: Trips are time-dependent executions of routes and they are grouped according to which
route they belong to.
 Layer/Shapefile: These are two concepts that are interconnected. A shapefile is a data format
for GIS systems, which describes vector features, such as points, lines and polygons. When they
are used in QGIS they are represented as layers on the map.
 CRS: Coordinate Reference System is a system used to locate geographical entities. We will
mostly use WGS84 and WGS84/Pseudo Mercator.
 QGIS: In QGIS we will create layers, which will contain points and lines and fall under the
generic category of layer features.
 CSV file: Means Comma-Seperated Values file and is a text file in which every line represents a
data record. Each record consists of one or more fields, seperated by commas.
5
Definitions and Terminology
OpenStreetMap – website https://www.openstreetmap.org/
Definition: As stated on their website: “OpenStreetMap is a free geographic data.
OpenStreetMap built by volunteers largely from scratch and released with an
open-content license.”
Why do we need it?
It provides us with datasets about the road network. In most of the cases the raw
data from OpenStreetMap needs to be edited in some way in order to use in an
application.
6
Definitions and Terminology
General Transit Feed Specification(GTFS) – website
https://developers.google.com/transit/gtfs/
Definition: As stated on their website: “The General Transit Feed Specification
(GTFS) defines a common format for public transportation schedules and
associated geographic information.”
Why do we need it?
Because it will be the data format we will use to represent the transit network. It is
the mainstream approach when it comes to schedule-based datasets. Its
development is supported by big organizations and the number of data available
in this form is really bigger than in any other form.
7
Definitions and Terminology
QuantumGIS(QGIS) – website http://www.qgis.org/
Definition: It is an open-source Geographic Information System(GIS) that is used
for viewing, editing and analyzing geographic data.
Why do we need it?
We use QGIS for data collection. By exploiting the schedule-based nature of a
transit network, we create layers which contain all the geographical information of
a route. Taking for granted that we somehow know the longitude, latitude and id
of a station in a route, we can create a layer containing stops of a route. Next, we
use QGIS to create another layer with the data of the actual path the vehicle
follows. The output file format are shapefiles. When we open shapefiles with a GIS
system they are represented as layers.
8
Definitions and Terminology
PostgreSQL, PostGIS & Shapefile and DBF Loader Exporter -
http://www.postgresql.org/ , http://postgis.net/
Definition: PostgreSQL is an open-source, object-relational database system, while
PostGIS is an extension for that database which adds support for geographic
objects and allows to run queries for geographic related content in SQL. DBF
Loader Exporter is a simple tool for loading files into the database.
Why do we need them?
It is where we store our data. Furthermore, it allows to run queries to analyze and
edit our data. In combination with Python’s modules they are the two main tools
that we used in our script shp2GTFS.
9
Definitions and Terminology
shp2GTFS – not available on website yet.
Definition: It is a Python-based script we created that enables us to
transform shapefiles to a GTFS feed with less effort than any suggested
approach that we are aware of.
Why do we need it?
Because we need to convert the data we gathered with QGIS to GTFS
feeds. There were no tools available online that would enable us to
combine the simplicity of gathering data with QGIS and transform the
produced layers/shapefiles to a GTFS feed.
10
Basic Usage - QGIS
11
The interface:
Basic Toolbar, contains
most of the tools we
need
and will use
In this box we can see the layers have opened on our canvas.
Lon,Lat & CRS information
Canvas
Basic Usage - QGIS
From the basic toolbar we will need:
12
Add feature
(Point or Line)
Used to open a map
layer(plugin)
Select and deselect a
feature
Basic Usage - QGIS
About the Plugin and the creation of a layer
13
Basic Usage - QGIS
Note that:
 When we create and save a shapefile to a folder in our computer, there
are several files created with it. We only care about the shapefile, but
we want to keep the other files too, because they contain information
about CRS, encoding, etc. Opening and editing the shapefile only is
enough and other files change accordingly.
 We are always have to open and work on a OpenStreetMap map layer
and not use other services like Bing Maps or Google Maps.
14
Basic Usage – PostgreSQL - PostGIS
We use the software through a graphic user interface called pgAdmin 3.
15
pgAdmin’s icon:
Basic Usage – PostgreSQL - PostGIS
16
In order to load files onto the
Database we use the a plugin:
After entering username, password
and database name we can connect
and load shapefiles onto the database.
Basic Usage – PostgreSQL - PostGIS
*Note that:
When installing the database software we enter username and password.
We used same username and password for all of the PCs we used.
Also before using the loader software you should add to “Path” System
Variable the path to the location of the “bin” folder in Postgres’s
installation folder.
(The path on my computer is: C:Program FilesPostgreSQL9.3bin )
17
Basic Usage – shp2GTFS
shp2GTFS is a tool we created and is still in development phase. Its usage
is based around the keyfiles. Keyfiles are csv files that contain
information about the timetable of the trips and other logic that needs to
be included in a GTFS feed.
Actually, it connects to the database using a python module, retrieves the
data we ask it to retrieve, and combining them with the information on the
keyfiles it produces the GTFS files one by one.
18
Basic Usage – shp2GTFS
*Note that:
Using shp2GTFS is achievable through Windows Command Prompt and
Cygwin.
Before using shp2GTFS you should set System Variable “client_encoding”
to “UTF-8”.
All input and output paths in the script need to be changed before using it
on another computer or you need to have the exact same folder setup on
all computers.
19
Thank you for your attention!
For more information on our project, visit:
http://www.greenyourmove.org/
Or contact us at:
dimrizopoulos@gmail.com
saharidis@gmail.com
info@greenyourmove.eu
20

More Related Content

PPTX
LIFE GreenYourMove Project - GTFS data
ODP
Multilevel Audio Descriptors @WWW09 develtrack
ODP
ckan 2.0 Introduction (20140618 updated)
PPTX
An Approach for the Incremental Export of Relational Databases into RDF Graphs
PDF
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
PPTX
Incremental Export of Relational Database Contents into RDF Graphs
PDF
Towards a More Efficient Paradigm of Storing and Querying Spatial Data on the...
LIFE GreenYourMove Project - GTFS data
Multilevel Audio Descriptors @WWW09 develtrack
ckan 2.0 Introduction (20140618 updated)
An Approach for the Incremental Export of Relational Databases into RDF Graphs
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Incremental Export of Relational Database Contents into RDF Graphs
Towards a More Efficient Paradigm of Storing and Querying Spatial Data on the...

What's hot (20)

PDF
Unit3 slides
ODP
ckan 2.0 Introduction (20140522 updated)
PDF
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
PPTX
RDF-Gen: Generating RDF from streaming and archival data
ODP
CKANCon 2016 & IODC16
PDF
Adventures in Linked Data Land (presentation by Richard Light)
PPTX
EuroGeographics Linked Data by geo.admin.ch /swisstopo
ODP
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
PDF
Graph Analytics with ArangoDB
ODP
Geo2tag LBS platform training at FRUCT12
PDF
Culture Geeks Feb talk: Adventures in Linked Data Land
PPT
Data Grid Taxonomies
PPT
Tx well data final
PPT
Ccp4 mmdb-python
PDF
Apache Spark — Fundamentals and MLlib
PDF
20090925 HTML5の過去、現在、未来
PDF
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
PPT
Summary of HDF-EOS5 Files, Data Model and File Format
PDF
Graph basedrdf storeforapachecassandra
PDF
Datomic rtree-pres
Unit3 slides
ckan 2.0 Introduction (20140522 updated)
Geospatial querying in Apache Marmotta - ApacheCon Big Data Europe 2015
RDF-Gen: Generating RDF from streaming and archival data
CKANCon 2016 & IODC16
Adventures in Linked Data Land (presentation by Richard Light)
EuroGeographics Linked Data by geo.admin.ch /swisstopo
將 Open Data 放上 Open Source Platforms: 開源資料入口平台 CKAN 開發經驗分享
Graph Analytics with ArangoDB
Geo2tag LBS platform training at FRUCT12
Culture Geeks Feb talk: Adventures in Linked Data Land
Data Grid Taxonomies
Tx well data final
Ccp4 mmdb-python
Apache Spark — Fundamentals and MLlib
20090925 HTML5の過去、現在、未来
RDF4U: RDF Graph Visualization by Interpreting Linked Data as Knowledge
Summary of HDF-EOS5 Files, Data Model and File Format
Graph basedrdf storeforapachecassandra
Datomic rtree-pres
Ad

Similar to Presentation data collection and gtfs (20)

PDF
Integrating PostGIS in Web Applications
PPTX
PDF
GIS and QGIS training notes
PPTX
123_BASICSAND FUNDAMENTASL OF gisrs.pptx
PPTX
Introduction to GIS
PPT
GIS_Whirlwind_Tour.ppt
PPT
GIS_Whirlwind_Tour.ppt
PPT
GIS_Whirlwind_Tour.ppt
PPT
GIS_Whirlwind_Tour.ppt
PPTX
GIS Level 1 Introduction to GIS and Mapping
PPT
Introduction to Geographic Information System (GIS)
PDF
GIS for Recorders
PDF
Smash & Geopaparazzi - State of the art 2021
PPT
GIS.ppt excellent exposure in only 25 slides
PDF
pecepcion remota y sistemas de informacion geografica
PPTX
JRC, GHA Live presentation: Maps, GISs and SDIs
PDF
Saving Money with Open Source GIS
PPT
introduction to _Quantum Geographical Information System.ppt
PPT
The development of a Geographic Information System for traffic route planni...
PDF
QGIS Open Source Desktop GIS
Integrating PostGIS in Web Applications
GIS and QGIS training notes
123_BASICSAND FUNDAMENTASL OF gisrs.pptx
Introduction to GIS
GIS_Whirlwind_Tour.ppt
GIS_Whirlwind_Tour.ppt
GIS_Whirlwind_Tour.ppt
GIS_Whirlwind_Tour.ppt
GIS Level 1 Introduction to GIS and Mapping
Introduction to Geographic Information System (GIS)
GIS for Recorders
Smash & Geopaparazzi - State of the art 2021
GIS.ppt excellent exposure in only 25 slides
pecepcion remota y sistemas de informacion geografica
JRC, GHA Live presentation: Maps, GISs and SDIs
Saving Money with Open Source GIS
introduction to _Quantum Geographical Information System.ppt
The development of a Geographic Information System for traffic route planni...
QGIS Open Source Desktop GIS
Ad

More from LIFE GreenYourMove (17)

PPTX
Gym 7th research activity day
PPTX
Timetable synchronization
PPTX
Benders Decomposition
PPTX
co-modal emission calculation and inventory-presentation
PPTX
ESCC2018, Mykonos, Greece, June 4-8, 2018, presentation by Rizopoulos D, Saha...
PPTX
ESCC2018, Mykonos, Greece, June 4-8, 2018, presentation by Fragkogios A., Sah...
PPTX
LIFE GreenYourMove Project
PPT
ESCC 2016, July 10-16, Athens, Greece
PPTX
ESCC 2016, July 10-16, Athens, Greece
PPTX
LIFE GYM 5th Hellenic forum for science technology and innovation
PPTX
LIFE GreenYourMove project: 1st workshop
PPTX
Presentation escc 2016
PPTX
Presentation 3rd CSUM
PPTX
Conference Volos
PPTX
3rd Conference on Sustainable Urban Mobility
PPTX
GreenYourMove 1st workshop
PPTX
GreenYourMove Presentation
Gym 7th research activity day
Timetable synchronization
Benders Decomposition
co-modal emission calculation and inventory-presentation
ESCC2018, Mykonos, Greece, June 4-8, 2018, presentation by Rizopoulos D, Saha...
ESCC2018, Mykonos, Greece, June 4-8, 2018, presentation by Fragkogios A., Sah...
LIFE GreenYourMove Project
ESCC 2016, July 10-16, Athens, Greece
ESCC 2016, July 10-16, Athens, Greece
LIFE GYM 5th Hellenic forum for science technology and innovation
LIFE GreenYourMove project: 1st workshop
Presentation escc 2016
Presentation 3rd CSUM
Conference Volos
3rd Conference on Sustainable Urban Mobility
GreenYourMove 1st workshop
GreenYourMove Presentation

Recently uploaded (20)

PPT
Occupational Health and Safety Management System
PDF
Visual Aids for Exploratory Data Analysis.pdf
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PDF
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
PPTX
Software Engineering and software moduleing
PDF
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PDF
Soil Improvement Techniques Note - Rabbi
PDF
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
PPTX
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
PPT
Total quality management ppt for engineering students
PPTX
Artificial Intelligence
PDF
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
PDF
August 2025 - Top 10 Read Articles in Network Security & Its Applications
PDF
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
PDF
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
PDF
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
PPTX
Nature of X-rays, X- Ray Equipment, Fluoroscopy
PPTX
introduction to high performance computing
Occupational Health and Safety Management System
Visual Aids for Exploratory Data Analysis.pdf
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
Accra-Kumasi Expressway - Prefeasibility Report Volume 1 of 7.11.2018.pdf
Software Engineering and software moduleing
SMART SIGNAL TIMING FOR URBAN INTERSECTIONS USING REAL-TIME VEHICLE DETECTI...
Fundamentals of safety and accident prevention -final (1).pptx
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
Soil Improvement Techniques Note - Rabbi
Influence of Green Infrastructure on Residents’ Endorsement of the New Ecolog...
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
Total quality management ppt for engineering students
Artificial Intelligence
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
August 2025 - Top 10 Read Articles in Network Security & Its Applications
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
Artificial Superintelligence (ASI) Alliance Vision Paper.pdf
Nature of X-rays, X- Ray Equipment, Fluoroscopy
introduction to high performance computing

Presentation data collection and gtfs

  • 1. Introduction to Data Collection & General Transit Feed Specification University of Thessaly,2015 GreenYourMove Project With the contribution of the LIFE programme of the European Union - LIFE14 ENV/GR/000611
  • 3. How are GTFS produced? 3
  • 4. Contents of this presentation  Definitions and Terminology of:  Stop, Routes, Trips, Layers/Shapefiles, CRS, layer features, csv files  OpenStreetMap  General Transit Feed Specification(GTFS)  QuantumGIS(QGIS) & OpenLayers Plugin  PostgreSQL, PostGIS & Shapefile and DBF Loader Exporter  shp2GTFS  Basic usage of:  QGIS  PostgreSQL & PostGIS  shp2GTFS  Conclusion and final results 4
  • 5. Definitions and Terminology  Stop: An object that is defined by a geographic point on Earth and a name/ID, and is usually served/contained in public transit route.  Route: A sequence of stops that are served by a specific vehicle one after another. Information in a route are geospatial objects with IDs and are time-independent .  Trips: Trips are time-dependent executions of routes and they are grouped according to which route they belong to.  Layer/Shapefile: These are two concepts that are interconnected. A shapefile is a data format for GIS systems, which describes vector features, such as points, lines and polygons. When they are used in QGIS they are represented as layers on the map.  CRS: Coordinate Reference System is a system used to locate geographical entities. We will mostly use WGS84 and WGS84/Pseudo Mercator.  QGIS: In QGIS we will create layers, which will contain points and lines and fall under the generic category of layer features.  CSV file: Means Comma-Seperated Values file and is a text file in which every line represents a data record. Each record consists of one or more fields, seperated by commas. 5
  • 6. Definitions and Terminology OpenStreetMap – website https://www.openstreetmap.org/ Definition: As stated on their website: “OpenStreetMap is a free geographic data. OpenStreetMap built by volunteers largely from scratch and released with an open-content license.” Why do we need it? It provides us with datasets about the road network. In most of the cases the raw data from OpenStreetMap needs to be edited in some way in order to use in an application. 6
  • 7. Definitions and Terminology General Transit Feed Specification(GTFS) – website https://developers.google.com/transit/gtfs/ Definition: As stated on their website: “The General Transit Feed Specification (GTFS) defines a common format for public transportation schedules and associated geographic information.” Why do we need it? Because it will be the data format we will use to represent the transit network. It is the mainstream approach when it comes to schedule-based datasets. Its development is supported by big organizations and the number of data available in this form is really bigger than in any other form. 7
  • 8. Definitions and Terminology QuantumGIS(QGIS) – website http://www.qgis.org/ Definition: It is an open-source Geographic Information System(GIS) that is used for viewing, editing and analyzing geographic data. Why do we need it? We use QGIS for data collection. By exploiting the schedule-based nature of a transit network, we create layers which contain all the geographical information of a route. Taking for granted that we somehow know the longitude, latitude and id of a station in a route, we can create a layer containing stops of a route. Next, we use QGIS to create another layer with the data of the actual path the vehicle follows. The output file format are shapefiles. When we open shapefiles with a GIS system they are represented as layers. 8
  • 9. Definitions and Terminology PostgreSQL, PostGIS & Shapefile and DBF Loader Exporter - http://www.postgresql.org/ , http://postgis.net/ Definition: PostgreSQL is an open-source, object-relational database system, while PostGIS is an extension for that database which adds support for geographic objects and allows to run queries for geographic related content in SQL. DBF Loader Exporter is a simple tool for loading files into the database. Why do we need them? It is where we store our data. Furthermore, it allows to run queries to analyze and edit our data. In combination with Python’s modules they are the two main tools that we used in our script shp2GTFS. 9
  • 10. Definitions and Terminology shp2GTFS – not available on website yet. Definition: It is a Python-based script we created that enables us to transform shapefiles to a GTFS feed with less effort than any suggested approach that we are aware of. Why do we need it? Because we need to convert the data we gathered with QGIS to GTFS feeds. There were no tools available online that would enable us to combine the simplicity of gathering data with QGIS and transform the produced layers/shapefiles to a GTFS feed. 10
  • 11. Basic Usage - QGIS 11 The interface: Basic Toolbar, contains most of the tools we need and will use In this box we can see the layers have opened on our canvas. Lon,Lat & CRS information Canvas
  • 12. Basic Usage - QGIS From the basic toolbar we will need: 12 Add feature (Point or Line) Used to open a map layer(plugin) Select and deselect a feature
  • 13. Basic Usage - QGIS About the Plugin and the creation of a layer 13
  • 14. Basic Usage - QGIS Note that:  When we create and save a shapefile to a folder in our computer, there are several files created with it. We only care about the shapefile, but we want to keep the other files too, because they contain information about CRS, encoding, etc. Opening and editing the shapefile only is enough and other files change accordingly.  We are always have to open and work on a OpenStreetMap map layer and not use other services like Bing Maps or Google Maps. 14
  • 15. Basic Usage – PostgreSQL - PostGIS We use the software through a graphic user interface called pgAdmin 3. 15 pgAdmin’s icon:
  • 16. Basic Usage – PostgreSQL - PostGIS 16 In order to load files onto the Database we use the a plugin: After entering username, password and database name we can connect and load shapefiles onto the database.
  • 17. Basic Usage – PostgreSQL - PostGIS *Note that: When installing the database software we enter username and password. We used same username and password for all of the PCs we used. Also before using the loader software you should add to “Path” System Variable the path to the location of the “bin” folder in Postgres’s installation folder. (The path on my computer is: C:Program FilesPostgreSQL9.3bin ) 17
  • 18. Basic Usage – shp2GTFS shp2GTFS is a tool we created and is still in development phase. Its usage is based around the keyfiles. Keyfiles are csv files that contain information about the timetable of the trips and other logic that needs to be included in a GTFS feed. Actually, it connects to the database using a python module, retrieves the data we ask it to retrieve, and combining them with the information on the keyfiles it produces the GTFS files one by one. 18
  • 19. Basic Usage – shp2GTFS *Note that: Using shp2GTFS is achievable through Windows Command Prompt and Cygwin. Before using shp2GTFS you should set System Variable “client_encoding” to “UTF-8”. All input and output paths in the script need to be changed before using it on another computer or you need to have the exact same folder setup on all computers. 19
  • 20. Thank you for your attention! For more information on our project, visit: http://www.greenyourmove.org/ Or contact us at: dimrizopoulos@gmail.com saharidis@gmail.com info@greenyourmove.eu 20