Properties |
Structured
data |
Semi-structured
data |
Unstructured
data |
Basic |
Data whose
elements are addressable for effective analysis and organized into a
formatted tables, schemas or repository that is typically a database. |
Data is
information that does not reside in a relational database but that have some
organizational properties that make it easier to analyze. With some process, you can store
them in the relation database. |
Data is a data
which is not organized in a predefined manner or does not have a predefined
data model; thus it is not a good fit for a mainstream relational database. There
are some alternative platforms for storing and managing, |
Databases |
RDBMS like Oracle,
MySQL, PostgreSQL.
Commonly data
stored in data warehouses. |
Non RDBMS / NoSQL
databases like Mongo DB, Dynamo DB, Riak, Redis, etc.
Follows Hadoop
Methodology
Commonly data
stored in data lakes and data marts. |
NoSQL databases
like Mongo DB, Cassandra
HBase, CouchDB, Dynamo DB, Riak, Redis, etc.
Store’s character
and binary data such as pictures, audio, video, pdf, log files, satellite
images, scientific images, radar data, etc
Commonly data
stored in data lakes and data marts. |
Scalability |
Very difficult
to scale DB schema. Can apply horizonal and vertical scaling |
scaling is
simpler than structured data |
more scalable. |
Transactions |
Matured transaction
and various concurrency techniques supports ACID |
Transaction is
adapted from DBMS not matured |
No transaction
management and no concurrency |
Flexibility |
It is schema
dependent and less flexible
Having predefined
format of data
Schema on write |
It is more
flexible than structured data but less flexible than unstructured data.
Variety of data
in shapes and sizes.
Schema on read |
It is more
flexible and there is absence of schema.
Variety of data
in shapes and sizes.
Schema on read |
Query
performance |
Structured
query allow complex joining |
Queries over
anonymous nodes are possible |
Only textual
queries are possible |
Version
management |
Versioning over
tuples,row,tables |
Versioning over
tuples or graph is possible |
Versioned as a whole |
Robustness |
Very robust |
New technology,
not very spread |
New technology,
not very spread |
Tips and good resources for all.. Oracle, Big data, Hadoop, Unix, Linux
Tuesday, August 3, 2021
Difference between Structured, Semi-Structured and Un Structured data
Subscribe to:
Posts (Atom)