SlideShare a Scribd company logo
MongoDB in the context of the Argentinean Census 2010 Mongo France 2011 Victorio J. BENTIVOGLI Villmond Luxembourg
Who we are? Villmond Luxembourg is an Enterprise Content Management (ECM) and Integration services provider established in 2005 Hosted at the Technoport, the first technology-oriented business incubator in Luxembourg, part of the Public Research Center Henri Tudor Delivers strategic Content Management, Collaboration and Integration solutions based on Open Source and proprietary software to some of the most demanding organisations
The Argentinean Census 2010 –  The context Around 43.000.000 inhabitants 200.000.000 images to process 4 months to complete the processing of every single form 15 days to produce  a working mockup  of a QA system
The Argentinean Census 2010 –  Partners
The Argentinean Census 2010 –  The QA system Aimed at controlling the quality of processed booklets Shows images and metadata of scanned booklets Operated 24x7 used by 120 concurrent users
The Argentinean Census 2010 –  The challenge We needed a rapid development cycle because the specifications were a moving target, and …  a really scalable Content Management System that could cope with 200.000.000 documents/images …  not only scalable but fast for insertions with 14 scanners working simultaneously 24x7 and importing around 2.000.000 images daily …  with enterprise class security, content  transformation capabilities and reliable!!!
The Argentinean Census 2010 –  Our proposal For the backend : use  MongoDB  as the underlying database coupled with our  Villmond Content Integration Framework  to complete the solution For the frontend : Develop an  Adobe Flex  based client, encapsulated into an  Adobe AIR  container
MongoDB Document-oriented storage Full Index Support Replication & High Availability Querying Map/Reduce GridFS Commercial Support
Villmond Content Integration Framework The Framework helps organisations to build robust applications that support critical content centric processes. It includes: Support for multiple Content Management platforms, including commercial products like EMC Documentum and Open Source offerings like Alfresco and  MongoDB . This allows reusability, maximising the return on investment (ROI) and avoiding vendor lock-in A ready to use, carefully crafted set of services that supports the entire lifecycle of critical content, shortening development time and improving the overall quality of applications A companion module that facilitates the migration of entire repositories between different platforms
Adobe Flex Front runner in RIA technology, it is cross platform, cross browser. It was conceived as a Domain Specific Language for rich UI development Uses a combination of MXML and ActionScript; and integrates with backend services written in Java or .Net Adobe Flex  SDK is Open Source. Adobe provides a comprehensive set of tools for development (Catalyst, Flash Builder, …) Requires a Adobe Flash Player or Adobe AIR runtime, both available as free downloads Has got wide industry adoption
SV at a glance –  Backend features Web services exposed as REST or Java method calls Distributed sessions and master/slaves configuration using Hazelcast (Slave nodes can be added transparently) Communication using JSON The project is wired using the Spring Framework Enterprise class security managed with Spring Security (formerly ACEGI security) LDAP access for user / roles based authentication The classes that manage the access to  MongoDB  are decoupled and can be replicated (LUNA, LUNB, …)
SV at a glance –  Backend services ECM-Core ECM-Mongo MongoDB LUNB AuthenticationService MongoDB LUNA Filesystem LUNB LDAP Filesystem LUNA SV Core and Service Implementation DataService ImportService UnitManagementService
SV at a glance –  Backend services (cont.) Authentication Service: Provides the connection to LDAP for authentication and authorization credentials Adds the configuration to manage execution of methods depending on the given roles ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
SV at a glance –  Backend services (cont.) Data Service: Retrieves imported booklets Retrieves composed images ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
SV at a glance –  Backend services (cont.) Import Service: Validates lot importing without altering the database Imports lots of booklets and control units ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
SV at a glance –  Backend services (cont.) Unit Management Service: Promotes the control units to the different states Manages purging of rejected and erroneous booklets ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
Design issues concerning  MongoDB Booklet metadata is stored in  MongoDB Two  MongoDB  databases for LUNA and LUNB, could be configured as replica sets Booklets and booklet pages are composed records in a collection, allowing to be found fast during a search Deletions and state promotions are performed in background MongoDB  slaves can potentially be accessed concurrently from many Tomcats. The synchronization is accomplished using Hazelcast Booklet importing is executed on the master node/primary  MongoDB  instance
Afterthoughts / lessons learnt MongoDB  and  Adobe Flex  are a great set of tools for rich content applications The data model is essential Content might be stored into the database as well to facilitate enforcement of the appropriate lifecycle The Java driver is great / easy to use Currently, we are using our own mapping mechanism for the DTOs (Data Transfer Object), but we would evaluate Morphia in the future
Thank you !

More Related Content

PPT
Websphere - overview and introduction
PDF
[Meetup] Building an Integration Agile Digital Enterprise
PDF
[Open Source Summit 2019] Microservices with Ballerina
PPTX
ASP.NET 4 & Web Dev in Visual Studio 2010 - Alex Mackey, Readify
PPTX
Web changesandasp4 upload
PPTX
Visual Studio 2010 IDE Enhancements - Alex Mackey, Readify
PDF
Chapter10 web
PPT
Part 2 IBM db2 content manager API training Slides
Websphere - overview and introduction
[Meetup] Building an Integration Agile Digital Enterprise
[Open Source Summit 2019] Microservices with Ballerina
ASP.NET 4 & Web Dev in Visual Studio 2010 - Alex Mackey, Readify
Web changesandasp4 upload
Visual Studio 2010 IDE Enhancements - Alex Mackey, Readify
Chapter10 web
Part 2 IBM db2 content manager API training Slides

What's hot (20)

PPT
IBM db2 content manager API training Slides
PPTX
WebRadar
PDF
Real-Time ETL in Practice with WSO2 Enterprise Integrator
PPT
Introduction To Adobe Flex And Semantic Resources
PDF
Explore the Latest on WSO2 Identity Server 5.11
PDF
[Workshop] API-driven Integration
PDF
Exposing GraphQLs as Managed APIs
PPT
Web 2 0 Requirements
PPTX
OPEN TEXT ADMINISTRATION
PPT
Biz Talk Overview
PDF
Exploring FireDAC
PPTX
Introducing the WSO2 Enterprise Integrator 6.1
PDF
[Webinar with Oceane Consulting] Using Vaadin to Integrate Nuxeo and Liferay
PPTX
Introducing Windows Azure BizTalk Services
PDF
What’s new in WSO2 Enterprise Integrator 6.6
PDF
Function of PHP in Website Development
PPTX
A lap around Windows Azure BizTalk Services - London - September 2013
PDF
A journey from Java EE to cloud-native microservices - Rabobank, JUG meetup
PDF
WSO2 Product Release Webinar: WSO2 Dashboard Server 2.0
PPTX
BizTalk: Server, Services and Apps
IBM db2 content manager API training Slides
WebRadar
Real-Time ETL in Practice with WSO2 Enterprise Integrator
Introduction To Adobe Flex And Semantic Resources
Explore the Latest on WSO2 Identity Server 5.11
[Workshop] API-driven Integration
Exposing GraphQLs as Managed APIs
Web 2 0 Requirements
OPEN TEXT ADMINISTRATION
Biz Talk Overview
Exploring FireDAC
Introducing the WSO2 Enterprise Integrator 6.1
[Webinar with Oceane Consulting] Using Vaadin to Integrate Nuxeo and Liferay
Introducing Windows Azure BizTalk Services
What’s new in WSO2 Enterprise Integrator 6.6
Function of PHP in Website Development
A lap around Windows Azure BizTalk Services - London - September 2013
A journey from Java EE to cloud-native microservices - Rabobank, JUG meetup
WSO2 Product Release Webinar: WSO2 Dashboard Server 2.0
BizTalk: Server, Services and Apps
Ad

Viewers also liked (13)

PPTX
Crunch Your Data in the Cloud with Elastic Map Reduce - Amazon EMR Hadoop
PPT
No It Cloud Computing
PDF
Bil2010 Millicomputing - The Future In Your Pocket
PPTX
Gluecon keynote
PDF
Social Media Assignment- Nissan Micra
PPTX
Gluecon 2013 - NetflixOSS Cloud Native Tutorial Introduction
PDF
SV Forum Platform Architecture SIG - Netflix Open Source Platform
PDF
Capacity Planning for Virtualized Datacenters - Sun Network 2003
PPTX
Architectures for High Availability - QConSF
PDF
Netflix on Cloud - combined slides for Dev and Ops
PDF
Cloud Architecture Tutorial - Why and What (1of 3)
PPTX
Netflix and Open Source
PPTX
AWS Re:Invent - High Availability Architecture at Netflix
Crunch Your Data in the Cloud with Elastic Map Reduce - Amazon EMR Hadoop
No It Cloud Computing
Bil2010 Millicomputing - The Future In Your Pocket
Gluecon keynote
Social Media Assignment- Nissan Micra
Gluecon 2013 - NetflixOSS Cloud Native Tutorial Introduction
SV Forum Platform Architecture SIG - Netflix Open Source Platform
Capacity Planning for Virtualized Datacenters - Sun Network 2003
Architectures for High Availability - QConSF
Netflix on Cloud - combined slides for Dev and Ops
Cloud Architecture Tutorial - Why and What (1of 3)
Netflix and Open Source
AWS Re:Invent - High Availability Architecture at Netflix
Ad

Similar to MongoDB in the context of the Argentinean Census 2010 (20)

PDF
MongoDB@sfr.fr
PPTX
MediaGlu and Mongo DB
PDF
Processing large-scale graphs with Google Pregel
PDF
Mongo bbmw
PPTX
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
PPTX
Branf final bringing mongodb into your organization - mongo db-boston2012
PDF
No SQL Technologies
PPTX
Introducing MongoDB into your Organization
PPTX
Use Case: Apollo Group at Oracle Open World
PPTX
An Evening with MongoDB Detroit 2013
PDF
MongoTorino 2013 Opening Keynote
PDF
Intro To MongoDB
PPT
Welcome and Introduction to A Morning with MongoDB Petah Tikvah
PPTX
An Introduction to Big Data, NoSQL and MongoDB
PDF
Mongo db transcript
PPTX
Welcome to MongoDB Tokyo 2012
PPT
Mongodb open source_high_performance_database
KEY
Why we chose mongodb for guardian.co.uk
PPT
A Morning with MongoDB - Helsinki
PDF
Couch Db
MongoDB@sfr.fr
MediaGlu and Mongo DB
Processing large-scale graphs with Google Pregel
Mongo bbmw
How Government Agencies are Using MongoDB to Build Data as a Service Solutions
Branf final bringing mongodb into your organization - mongo db-boston2012
No SQL Technologies
Introducing MongoDB into your Organization
Use Case: Apollo Group at Oracle Open World
An Evening with MongoDB Detroit 2013
MongoTorino 2013 Opening Keynote
Intro To MongoDB
Welcome and Introduction to A Morning with MongoDB Petah Tikvah
An Introduction to Big Data, NoSQL and MongoDB
Mongo db transcript
Welcome to MongoDB Tokyo 2012
Mongodb open source_high_performance_database
Why we chose mongodb for guardian.co.uk
A Morning with MongoDB - Helsinki
Couch Db

MongoDB in the context of the Argentinean Census 2010

  • 1. MongoDB in the context of the Argentinean Census 2010 Mongo France 2011 Victorio J. BENTIVOGLI Villmond Luxembourg
  • 2. Who we are? Villmond Luxembourg is an Enterprise Content Management (ECM) and Integration services provider established in 2005 Hosted at the Technoport, the first technology-oriented business incubator in Luxembourg, part of the Public Research Center Henri Tudor Delivers strategic Content Management, Collaboration and Integration solutions based on Open Source and proprietary software to some of the most demanding organisations
  • 3. The Argentinean Census 2010 – The context Around 43.000.000 inhabitants 200.000.000 images to process 4 months to complete the processing of every single form 15 days to produce a working mockup of a QA system
  • 4. The Argentinean Census 2010 – Partners
  • 5. The Argentinean Census 2010 – The QA system Aimed at controlling the quality of processed booklets Shows images and metadata of scanned booklets Operated 24x7 used by 120 concurrent users
  • 6. The Argentinean Census 2010 – The challenge We needed a rapid development cycle because the specifications were a moving target, and … a really scalable Content Management System that could cope with 200.000.000 documents/images … not only scalable but fast for insertions with 14 scanners working simultaneously 24x7 and importing around 2.000.000 images daily … with enterprise class security, content transformation capabilities and reliable!!!
  • 7. The Argentinean Census 2010 – Our proposal For the backend : use MongoDB as the underlying database coupled with our Villmond Content Integration Framework to complete the solution For the frontend : Develop an Adobe Flex based client, encapsulated into an Adobe AIR container
  • 8. MongoDB Document-oriented storage Full Index Support Replication & High Availability Querying Map/Reduce GridFS Commercial Support
  • 9. Villmond Content Integration Framework The Framework helps organisations to build robust applications that support critical content centric processes. It includes: Support for multiple Content Management platforms, including commercial products like EMC Documentum and Open Source offerings like Alfresco and MongoDB . This allows reusability, maximising the return on investment (ROI) and avoiding vendor lock-in A ready to use, carefully crafted set of services that supports the entire lifecycle of critical content, shortening development time and improving the overall quality of applications A companion module that facilitates the migration of entire repositories between different platforms
  • 10. Adobe Flex Front runner in RIA technology, it is cross platform, cross browser. It was conceived as a Domain Specific Language for rich UI development Uses a combination of MXML and ActionScript; and integrates with backend services written in Java or .Net Adobe Flex SDK is Open Source. Adobe provides a comprehensive set of tools for development (Catalyst, Flash Builder, …) Requires a Adobe Flash Player or Adobe AIR runtime, both available as free downloads Has got wide industry adoption
  • 11. SV at a glance – Backend features Web services exposed as REST or Java method calls Distributed sessions and master/slaves configuration using Hazelcast (Slave nodes can be added transparently) Communication using JSON The project is wired using the Spring Framework Enterprise class security managed with Spring Security (formerly ACEGI security) LDAP access for user / roles based authentication The classes that manage the access to MongoDB are decoupled and can be replicated (LUNA, LUNB, …)
  • 12. SV at a glance – Backend services ECM-Core ECM-Mongo MongoDB LUNB AuthenticationService MongoDB LUNA Filesystem LUNB LDAP Filesystem LUNA SV Core and Service Implementation DataService ImportService UnitManagementService
  • 13. SV at a glance – Backend services (cont.) Authentication Service: Provides the connection to LDAP for authentication and authorization credentials Adds the configuration to manage execution of methods depending on the given roles ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
  • 14. SV at a glance – Backend services (cont.) Data Service: Retrieves imported booklets Retrieves composed images ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
  • 15. SV at a glance – Backend services (cont.) Import Service: Validates lot importing without altering the database Imports lots of booklets and control units ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
  • 16. SV at a glance – Backend services (cont.) Unit Management Service: Promotes the control units to the different states Manages purging of rejected and erroneous booklets ECM-Core ECM-Mongo AuthenticationService SV Core and Service Implementation DataService ImportService UnitManagementService
  • 17. Design issues concerning MongoDB Booklet metadata is stored in MongoDB Two MongoDB databases for LUNA and LUNB, could be configured as replica sets Booklets and booklet pages are composed records in a collection, allowing to be found fast during a search Deletions and state promotions are performed in background MongoDB slaves can potentially be accessed concurrently from many Tomcats. The synchronization is accomplished using Hazelcast Booklet importing is executed on the master node/primary MongoDB instance
  • 18. Afterthoughts / lessons learnt MongoDB and Adobe Flex are a great set of tools for rich content applications The data model is essential Content might be stored into the database as well to facilitate enforcement of the appropriate lifecycle The Java driver is great / easy to use Currently, we are using our own mapping mechanism for the DTOs (Data Transfer Object), but we would evaluate Morphia in the future

Editor's Notes