Which Database Design is more effective in this scenario?

debugcn 投稿 Dev

Kiti

DB design 1: There is 1 table

     Create Table (id int primary key, name varchar(20), description varchar(10000));

DB design 2: There are 2 tables

       Create Table1 (id int primary key, name varchar(20));
       Create Table2 (id int primary key, description varchar(10000));

Note: each id must have a description associated with it. We don't query the description so often like name.

In the design 1, 1 simple query can get name & description, no need join but what if we have 1 million records, then will it be slow?

In the design 2, we need join so the database needs some searching & matching id --> this could be slow, but we don't query description often so it will be slow for sometime not all time.

So what is the better design in this scenario?

Branko Dimitrijevic

That's called vertical partitioning or "row splitting" and is no silver bullet (nothing is). You are not getting "better performance" you are just getting "different performance". Whether one set of performance characteristics is better than the other is a matter of engineering tradeoff and varies from one case to another.

In your case, 1 million rows will fit comfortably into DBMS cache on today's hardware, producing excellent performance. So unless some of the other reasons apply, keep it simple, in a single table.

And if its 1 billion rows (or 1 trillion or whatever number is too large for the memory standards of the day), keep in mind that if you have indexed your data correctly, the performance will remain excellent long after it became bigger than the cache.

Only in the most extreme of cases will you need to vertically partition the table for performance reasons - in which case you'll have to measure in your own environment with your own access patterns, and determine if it brings any performance benefit at all; and is it large enough to make up for the increased JOINing.

この記事はインターネットから収集されたものであり、転載の際にはソースを示してください。

侵害の場合は、連絡してください[email protected]

編集2021-06-23

コメントを追加

サインイン

分類Dev

Related 関連記事

記事

Which Database Design is more effective in this scenario?

Which Database Design is more effective in this scenario?

implicit dynamic linking vs explicit dynamic linking - which is more effective?

class/interface design approach for given scenario

How to make more readable and shorter a karate scenario

More effective way to filter data in R

Can I design any lock-free solution for this scenario

Is it correct to have more than one Then in a single Cucumber scenario?

In which scenario we should consider creating manual VPC(and subnets) for KOPS?

Right way to design database

Mongoose/ MongoDB Database Design

Best way to create a multi relational database in my scenario

less or more - which to use when?

Database design for like/love relations

Database design User Group Orders

How to design a database for medical products

A Real World Example of Vector vs List showing scenario where each is more efficient than the other

Which design pattern to use on Java workflow

How to compare two NSDates: Which is more recent?

index vs iterator - which would be more efficient?

Which technique is more efficient for replacing records

What's the effective way to insert more a million rows into postgresql server from another postgres server using Java?

Any scenario in which String.isEmpty() returns true and String.isBlank() returns false for the same input?

Domain driven design - database transaction management

User-Role-Permission based database design

Is a repository only limited to the database in domain driven design?

Best way to manage multiple category in database design

How to design database in order to store "default" items

Database Design: Circular reference and how to correct it

User with multiple roles and multiple teams database design

MySQL Null or empty fields - New Database Design