What are the main characteristics of NOSQL systems in the areas related to distributed systems and distributed databases?

 CHARACTERISTICS RELATED TO DISTRIBUTED DATABASES AND DISTRIBUTED SYSTEMS



 1.  Scalability: 

  • horizontal scalability: adding more nodes for data storage and processing as the volume of data grows.
  •  Vertical scalability: expanding the storage and computing power of existing nodes. 
  • In NoSQL systems, horizontal scalability is employed while the system is operational, so techniques for distributing the existing data among new nodes without interrupting system operation are necessary.

 2. Availability, Replication, and Eventual Consistency: 

  • Data is replicated over two or more nodes in a transparent manner. 
  • An update must be applied to every copy of the replicated data items. 
  •  Eventual consistency: is a consistency model used in distributed computing to achieve high availability that informally guarantees that, if no new updates are made to a given data item, eventually all accesses to that item will return the last updated value.

 3. Replication Models: 

  • Master-slave replication: requires one copy to be the master copy; 
  • Write operations must be applied to the master copy, usually using eventual consistency 
  • For read, all reads are from the master copy, or reads at the slave copies but would not guarantee that the values are the latest writes. 
  • Master-master replication: allows reads and writes at any of the replicas. 
  • The values of an item will be temporarily inconsistent. 
  • Reconciliation method to resolve conflicting write operations of the same data item at different nodes must be implemented as part of the master-master replication scheme.

Comments

Popular posts from this blog

Suppose that a data warehouse for Big-University consists of the following four dimensions: student, course, semester, and instructor, and two measures count and avg_grade. When at the lowest conceptual level (e.g., for a given student, course, semester, and instructor combination), the avg_grade measure stores the actual course grade of the student. At higher conceptual levels, avg_grade stores the average grade for the given combination. a) Draw a snowflake schema diagram for the data warehouse. b) Starting with the base cuboid [student, course, semester, instructor], what specific OLAP operations (e.g., roll-up from semester to year) should one perform in order to list the average grade of CS courses for each BigUniversity student. c) If each dimension has five levels (including all), such as “student < major < status < university < all”, how many cuboids will this cube contain (including the base and apex cuboids)?

Suppose that a data warehouse consists of the four dimensions; date, spectator, location, and game, and the two measures, count and charge, where charge is the fee that a spectator pays when watching a game on a given date. Spectators may be students, adults, or seniors, with each category having its own charge rate. a) Draw a star schema diagram for the data b) Starting with the base cuboid [date; spectator; location; game], what specific OLAP operations should perform in order to list the total charge paid by student spectators at GM Place in 2004?

Describe how cloud computing technology can be applied to support remote ECG monitoring.