Comments on: ArangoDB DigitalOcean Cluster: Scalable and Efficient Deployment https://arangodb.com/2015/04/digital-ocean-cluster/ The database for graph and beyond Wed, 26 Jun 2024 10:12:45 +0000 hourly 1 https://wordpress.org/?v=6.7.1 By: Max Neunhöffer https://arangodb.com/2015/04/digital-ocean-cluster/#comment-584 Tue, 22 Dec 2015 08:59:00 +0000 http://www.arangodb.com/?p=7121#comment-584 In reply to Christian Pekeler.

I have not analysed cost so far.

I did some analysis for https://mesosphere.com/blog/2015/11/30/arangodb-benchmark-dcos/ but was essentially only interested in maximizing throughput per vCPU of single document operations. I found that for that the sweet spot was using instances with 8 vCPUs and fast local SSDs, all running a primary DBServer, a secondary DBServer (asynchronous replica) and a coordinator. This was for AWS, but I would expect similar results for DO.

The best way to cut costs is almost certainly by not using local SSDs, since they make the instances expensive with all providers. However, this will almost immediately cost throughput, simply because the combined I/O performance of the instances cannot keep up. Note that we try to avoid write amplification as much as possible, but there is always some overhead, in particular since we have to write every document once to the write ahead log of the primary, once to the actual data file, and then the same for the asynchronous replica. If the combined I/O performance is the bottleneck, then one can often buy instances with less CPU power but the same I/O bandwidth and cut costs in this way.

As to reliability, Version 3.0 will greatly increase reliability because of the synchronous replication we are currently putting in.

For a real world application, I would first specify the replication and reliability needs, then specify the needed throughput with some reserves, which essentially will tell you whether or not you need SSDs. Then I would simply compare clusters with different sizes and numbers of droplets with the intended load. This should give you the sweet spot w.r.t. costs for your particular needs.

]]>
By: Phil Cockfield https://arangodb.com/2015/04/digital-ocean-cluster/#comment-583 Mon, 21 Dec 2015 23:23:00 +0000 http://www.arangodb.com/?p=7121#comment-583 Can you provide the starting point for this? Where do you run the .sh script from, and what do you export the TOKEN on? Is that on an existing droplet, that in turns sets up more droplets? Or is this at the setup phase of a new droplet?

Thanks

]]>
By: Christian Pekeler https://arangodb.com/2015/04/digital-ocean-cluster/#comment-582 Sat, 11 Apr 2015 03:50:00 +0000 http://www.arangodb.com/?p=7121#comment-582 Nice!
I wonder how a small cluster of large droplets compares to a large cluster of small droplets, in terms of performance, reliability, and cost. There’s probably a sweet spot somewhere.

]]>