for the beginners and laterals . . .: August 2021

Properties	Structured data	Semi-structured data	Unstructured data
Basic	Data whose elements are addressable for effective analysis and organized into a formatted tables, schemas or repository that is typically a database.	Data is information that does not reside in a relational database but that have some organizational properties that make it easier to analyze. With some process, you can store them in the relation database.	Data is a data which is not organized in a predefined manner or does not have a predefined data model; thus it is not a good fit for a mainstream relational database. There are some alternative platforms for storing and managing,
Databases	RDBMS like Oracle, MySQL, PostgreSQL. Commonly data stored in data warehouses.	Non RDBMS / NoSQL databases like Mongo DB, Dynamo DB, Riak, Redis, etc. Follows Hadoop Methodology Commonly data stored in data lakes and data marts.	NoSQL databases like Mongo DB, Cassandra HBase, CouchDB, Dynamo DB, Riak, Redis, etc. Store’s character and binary data such as pictures, audio, video, pdf, log files, satellite images, scientific images, radar data, etc Commonly data stored in data lakes and data marts.
Scalability	Very difficult to scale DB schema. Can apply horizonal and vertical scaling	scaling is simpler than structured data	more scalable.
Transactions	Matured transaction and various concurrency techniques supports ACID	Transaction is adapted from DBMS not matured	No transaction management and no concurrency
Flexibility	It is schema dependent and less flexible Having predefined format of data Schema on write	It is more flexible than structured data but less flexible than unstructured data. Variety of data in shapes and sizes. Schema on read	It is more flexible and there is absence of schema. Variety of data in shapes and sizes. Schema on read
Query performance	Structured query allow complex joining	Queries over anonymous nodes are possible	Only textual queries are possible
Version management	Versioning over tuples,row,tables	Versioning over tuples or graph is possible	Versioned as a whole
Robustness	Very robust	New technology, not very spread	New technology, not very spread

for the beginners and laterals . . .

Tuesday, August 3, 2021

Difference between Structured, Semi-Structured and Un Structured data