BigCouch Integration With CouchDB Brings Clustering And Improved Database Compaction To CouchDB

On Monday, Database as a Service vendor Cloudant announced plans to integrate its database service, BigCouch, into the Apache CouchDB project. BigCouch is an open source fork of CouchDB designed to support large-scale, distributed applications. The integration of BigCouch with CouchDB will provide CouchDB with enhanced scalability and performance in a move that is likely to accelerate adoption of the NoSQL CouchDB platform. In conjunction with its decision to integrate BigCouch into CouchDB, Cloudant announced that it will cease development of the BigCouch platform that was inspired by Amazon’s famous Dynamo research paper.

CouchDB will benefit principally from the clustering functionality that became one of the trademarks of BigCouch. Unlike CouchDB, BigCouch nodes reside in elastic clusters marked by consistent hashing, quorum rules for read/write operations and parallel indexing on data partitions as illustrated by the three node BigCouch development cluster below, in contrast to the unified CouchDB configuraton at the top of the picture:

Graphic source: Cloudant’s BigCouch is open-source

Parallel indexing across clusters allows the BigCouch configuration to demonstrate significant improvements in indexing speed in comparison to serial indexing of one database. CouchDB will also benefit from BigCouch’s database compaction functionality, replication speed and high-concurrency access performance.

Adam Kocoloski, co-founder and CTO at Cloudant, remarked on the merging of BigCouch with CouchDB as follows:

There are a lot of reasons people love CouchDB, like its elegant programming model, data durability, flexible indexing, and, most of all, its unique way of replicating and synching data across data centers or devices. We’re merging the horizontal scaling and fault-tolerance framework we built for BigCouch into CouchDB so people can more easily scale all that CouchDB goodness across multiple servers and keep it running nonstop. It’s our way of saying thanks and helping to grow the community of CouchDB developers and users.

Interested users can access a preview of the merger of CouchDB and BigCouch now, although the generally available version of the integrated database as a service will be released in conjunction with the release cycles of the Apache Foundation’s code release process. The integration of these two open source platforms represents a significant boost to the NoSQL community as options in the NoSQL space continue to proliferate and deepen in functionality as exemplified by Garantia’s recent acquisition of MyRedis.

Cloudant names Derek Schoettle As CEO

Cloudant named Derek Schoettle as its new CEO. Prior to Cloudant, Schoettle was VP of Sales at the HP acquisition Vertica. Cloudant provides a data management platform for the analysis of multi-petabyte, “Big Data” sets. Its BigCouch data management platform leverages the open source, NoSQL Apache CouchDB technology either via a “database as a service” through a public cloud such as Amazon EC2 or Rackspace, or a licensed offering for a private cloud. Cloudant made headlines in October when it reached a deal with agriculture company Monsanto to target genetic pathways that result in increased yield and tolerance of stress in corn, soy and other crops. Cloudant’s Big Data platform plans to house and run analytics on Monsanto’s growing body of data in order to accelerate genomic sequencing analysis of crops. CouchDB, Cloudant’s underlying technology, is a NoSQL data storage platform commercially distributed by Couchbase in addition to Cloudant. Late last week, Cloudant announced it raised $2.1 million in an equity funding filed with the Securities and Exchange Commission.