Follow @DBDebunk
Follow @ThePostWest
A serious problem in the database field is not just that many data professionals do not know and understand the RDM, but also that they believe they do and criticize it. It is relatively easy to detect such critics. In "Recognizing and Treating Tableitis" Gordon Everest tries to be humorous, but it ends up more sad than funny.
Sunday, January 17, 2016
Sunday, January 10, 2016
This Week
Follow @DBDebunk
Follow @ThePostWest
1. Quote of the Week
1. Quote of the Week
Table (n.) – a collection of information (data?) describing a population of entities which possess some common characteristics, called attributes. Tables are the building block of relational databases. Tables must generally be “normalized,” at least to 1NF. That may be an appropriate way to think of databases when implemented in a modern day DBMS. However, it is not the way the world thinks logically. People have no problem with commonly occurring phenomena such as:
* A multi-valued attribute, e.g., an Employee possesses multiple Skills.
* Many-to-many (M:N) relationships, e.g., as between Employees and Projects
* A relationship with attributes
--Gordon Everest, Recognizing and Treating Tableitis
Monday, January 4, 2016
The Real Data Science Series: 1NF In Theory & Practice
Follow @DBDebunk
Follow @ThePostWest
My January presentation for the San Francisco Microsoft Data Platform User Group:
Wednesday, January 13, 2016 6:30 PM
Microsoft Reactor, 680 Folsom Street , San
Francisco, CA (map)
In the early 70's E. F. Codd provided a very precise, formal
definition of a table in its normal form. Any table not normalized was in
violation of the RDM and not considered a R-table. But you are unlikely to have
encountered that definition. Instead you probably heard about "repeating
groups", "simple domains" and "atomic values", neither
of which are formal relational concepts. C. J. Date provided
a 1NF definition different than Codd's. And you probably think that the same
design principle underlies all normal forms, but 1NF is somewhat distinct.
This presentation introduces order and makes sense of all this,
including the practical implications for SQL database practice. It is first in
THE REAL DATA SCIENCE series (that includes papers and seminars) expounding the
Codd-McGoveran relational model, distinct from Date-Darwen's.You will learn:
• Normalization vs. further normalization
• Repeating Groups
• Simple domains and atomic values
• SQL and 1NF
Sunday, January 3, 2016
NoSQL and SQL: A Plague on Both Their Houses
Follow @DBDebunk
Follow @ThePostWest
Oracle Defends Relational DBs Against NoSQL Competitors prompted Does Oracle Really Understand NoSQL?, which was shared on LinkedIn and triggered a LinkedIn exchange in which I participated. The following comment is an adequate summary of the second article:
Oracle Defends Relational DBs Against NoSQL Competitors prompted Does Oracle Really Understand NoSQL?, which was shared on LinkedIn and triggered a LinkedIn exchange in which I participated. The following comment is an adequate summary of the second article:
JN: It's an unfortunate bit of propaganda. Some truth mixed in with distractions and irrelevant comparison. I've met the DataStax team. They're smart people with a solid understanding of their space. I'm disappointed to see them mix good and bad information into something that looks like objective truth.and the first is not much better. The interested reader can visit all three links. What I want to do here is amplify on some of my LinkedIn comments and add some.
Wednesday, December 30, 2015
Re-interpreting Codd: LOGIC FOR SERIOUS DATABASE FOLKS
Follow @DBDebunk
Follow @ThePostWest
David McGoveran has been working on a book on logic for database professionals. He will post articles online for review that will become chapters in the book. He has just posted first 4 along with a revised "Series Introduction" (link below).
I thought it's worth sharing a short note he sent me on what prompted the book and what some of its objectives are.
David McGoveran has been working on a book on logic for database professionals. He will post articles online for review that will become chapters in the book. He has just posted first 4 along with a revised "Series Introduction" (link below).
I thought it's worth sharing a short note he sent me on what prompted the book and what some of its objectives are.
Wednesday, December 23, 2015
Documents and Databases
Follow @DBDebunk
Follow @ThePostWest
My December post @All Analytics.
Don't let the label fool you. It's the relational data model (RDM), not SQL that NoSQL proponents really are rejecting. The main argument, advanced in a recent LinkedIn exchange, is that lots of information "cannot be represented in rows and columns". The implication is that the RDM is not general enough -- there are certain types of information that it cannot represent. The response from my colleague in RDM David McGoveran, is important enough to restate here.
Read it all. (Comment there, not here, please. Thanks.)
My December post @All Analytics.
Don't let the label fool you. It's the relational data model (RDM), not SQL that NoSQL proponents really are rejecting. The main argument, advanced in a recent LinkedIn exchange, is that lots of information "cannot be represented in rows and columns". The implication is that the RDM is not general enough -- there are certain types of information that it cannot represent. The response from my colleague in RDM David McGoveran, is important enough to restate here.
Read it all. (Comment there, not here, please. Thanks.)
Happy new year!
Sunday, December 20, 2015
Weekly Update
Follow @DBDebunk
Follow @ThePostWest
1. Quote of the Week
1. Quote of the Week
According to Wikipedia, Amazon's Redshift is a modified version of Postgres.
Maybe its speed redshifted data integrity into a bloody mess.
It has no primary keys, foreign keys or unique constraints. It just has optimizer hints in the DDL that *maybe* the data behaves that way. If they want to put hints in the DDL, OK, but don't call those hint PRIMARY KEY, FOREIGN KEY or UNIQUE.
--Unsupported PostgreSQL Features - Amazon Redshift
I don't think any version of Postgres lets you say Create Table Foo (Bar Int PRIMARY KEY) and then let you do
INSERT INTO Foo (BAR) Values (42)That might be fine for a one time only static data warehouse, but an ongoing data update system is going to break the integrity rules, it is just a matter of time. That would make for some surprises when someone decides to migrate their data and lots of application code from a relational DBMS to Redshift.
INSERT INTO Foo (BAR) Values (42)
INSERT INTO Foo (BAR) Values (42)
--Jeff Winchell
Subscribe to:
Posts (Atom)