MongoUK - Approaching 1 billion documents with MongoDB1 Billion Documents

Approaching 1 Billion
Documents in MongoDB

David Mytton
1/30 david@boxedice.com / @davidmytton

Server Density Monitoring

Processing Database UI

2/30
www.serverdensity.com

Cache / Data Store
Postback

checksLatest checksHistorical

3/30

db.stats()
Documents 937,393,315

Collections 27,566

Indexes 45,277

Stored data 638GB

Inserts 5000-8000/s

4/30
As of 17th Jun 2010.

13 months ago

5/30
Why we moved: http://bit.ly/mysqltomongo

Initial Setup

Replication

Master Slave
DC1 DC2
8GB RAM 8GB RAM

6/30

Vertical Scaling

Replication

Master Slave
DC1 DC2
72GB RAM 8GB RAM

7/30

Tip #1

Keep your indexes in
memory at all times.

db.stats()

8/30

Tip #2
Data is ﬂushed to disk every 60s.

db.runCommand({fsync:1});

--syncdelay [60]

10/30

Sharding solves
everything

11/30

Manual Partitioning
Replication

Master A Slave A
DC1 DC2
16GB RAM 16GB RAM

Replication

Master B Slave B
DC1 DC2
12/30 16GB RAM 16GB RAM

Sustained Trafﬁc

Master Slave
Avg out: 2.4Mbit/s Avg out: 4.0Mbit/s
Avg in: 3.8Mbit/s Avg in: 111.2Kbit/s

13/30

Database vs collections

• Many databases = many data ﬁles (small but
quickly get large).
• Many collections = watch namespace limit.

14/30

Namespaces = Number of collections +
number of indexes

15/30

Tip #3

Monitor the 24,000
namespace limit.

16/30

Using Server Density

17/30

Console

db.system.namespaces.count()

18/30

Replica Pairs = Failover
Replica Pair

Master A Slave A
DC1 DC2
16GB RAM 16GB RAM

Replica Pair

Master B Slave B
DC1 DC2
19/30 16GB RAM 16GB RAM

Tip #4

Pre-provision your oplog ﬁles.

20/30

A shell script to generate 75GB oplog ﬁles

for i in {0..40}
do echo $i
head -c 2146435072 /dev/zero > local.$i
done

21/30

Tip #5

Expect slower performance
during initial replica sync.

22/30

Tip #6

You can rotate your log ﬁles
from the console.

23/30

Rotating your log ﬁles

db.runCommand("logRotate")

24/30

Tip #7

Index creation blocks by
default. Use background
indexing if necessary.

25/30
MongoDB Manual: http://bit.ly/mongobgindex

Tip #8

Increase your OS ﬁle
descriptor limit + use
persistent connections.

26/30

Too many open ﬁles!
/etc/security/limits.conf
mongo hard nofile 10000
mongo soft nofile 10000
user type limit

/etc/ssh/sshd_conﬁg

UsePAM yes
27/30

Space is not reused
Data + indexes 551GB

Actual disk usage 638GB

Fixed in
1.1.4 1.3.x 1.5.0 1.5.1 1.5.2 1.5.3 1.5.4?

28/30
JIRA: SERVER-366

Summary
1. Keep indexes in memory.
2. Data is flushed to disk every 60s.
3. Monitor the 24k namespace limit.
4. Pre-provision oplog files.
5. Expect slower performance on replica sync.
6. Rotate logs from the console.
7. Index creation blocks by default.
29/30 8. OS file descriptor limit + persistent connections.

Slides
blog.boxedice.com/mongodb

David Mytton
30/30 david@boxedice.com / @davidmytton

MongoUK - Approaching 1 billion documents with MongoDB1 Billion Documents

More Related Content

What's hot (20)

Similar to MongoUK - Approaching 1 billion documents with MongoDB1 Billion Documents (20)

More from Boxed Ice (8)

Recently uploaded (20)

MongoUK - Approaching 1 billion documents with MongoDB1 Billion Documents