DB Design Question

Question

Post reply

DB Design Question

Kevin-138583

SSC Enthusiast

Points: 196
More actions
May 12, 2004 at 12:37 pm

#162428

I'm pretty new to DB design but am trying to write a project to help me learn. I have some general DB design questions and I'm looking for someone w/ some experience in these types of matters to help me out if possible.

My database is pretty well normalized (or at least I'm pretty sure it is). I've been writing some queries to do the functionality I want and they join a decent amount of tables (8-10). I'm concerned that eventually performance will drag. So, some general questions are...

1) Is 3rd normal form a good goal to shoot for when designing a DB? Or, is that probably "too normalized" in practice because it's probably "asking for trouble" later on when trying to optimize?

2) Is there a rule of thumb about having "too many tables in a query?" Does it indicate a problem if a query that is supposed to do some basic functionality of an application joins like 8-10 tables? Or is this normal?

3) When should the decision be made to de-normalize the DB? Should it be before you write the DB or, in small to medium projects, is it okay (and probably expected!) to do a re-design mid-way? Obviously the earlier you can get the design right the better but I'm asking more about people's experience in practice

4) What are people's thoughts about triggers/constraints for a DB? Many developers (and that's my background) seem to want to do all the constraining in their application. Personally, I don't like that approach. Well, to clarify, I don't think it's enough. I think you should put in application constraints but the best place to put them in is the DB. I put in some constraints on my DB but it's hard (and complicated) to put in (as well as think of) constraints for every possible situation. I guess some constraints is better than none though, right?

5) When in the development lifecycle should constraints be put into place? In my project, I spec'd out my application first. Then I did my DB design. Then I wrote my constraints. Then I wrote my code. Does that sound like the correct approach or is it "inadvisable" to write the constraints/triggers too early because a project is likely to change a good amount as it is developed.

I realize these are general questions but it would be helpful to me to hear what experienced people have to say on these issues. Thanks in advance for you comments.

Viewing 12 posts - 1 through 11 (of 11 total)

You must be logged in to reply to this topic. Login to reply

EdVassie SSC Guru Points: 60312 More actions · Answer 1

I will have a go at some of the questions...

a) You should always go for a 3NF design at the logical design stage. Until you have a 3NF design, you do not really know if you understand all the data, or even if you have all the necessary data. If you do not get to a 3NF design at the logical stage, you are very likely to have a very poor application at the physical stage.

b) You should de-normalise only when necessary. Never de-normalise because some people think it is cool. Only de-normalise when you start to put the physical model together. Do not even think about de-normalising at the logical stage.

c) Common reasons to de-normalise are

i) Large number of tables in frequently-used queries. Your main queries should join very few tables (2 - 4 maximum). Your once-a-month queries can join everything in the database if that is needed to get the answer.

ii) Very wide rows with infrequently used columns. It can be beneficial to split a wide row into two tables, with frequently used columns in 1 table and the rest in another.

iii) DBMS restrictions. If you have a number of LOBs in a row, it can be best/necessary to place the LOBs into separate tables.

iiii) Reduce number of dimension tables in a DW. Sometimes merging multiple dimensions into a single super-dimension can help performance.

d) Common reasons to NOT denormalise

i) No performance gain will be achieved. Unless performance is improved, there is no point in de-normalising.

ii) Database size will remain small (under 10 GB). In a small database, it is unlikely that denormalising a 3NF design will give any noticible performance gain. (See i !)

e) Constraints. Your approach to constraints seems quite sensible.

Original author: https://github.com/SQL-FineBuild/Common/wiki/ 1-click install and best practice configuration of SQL Server 2019, 2017 2016, 2014, 2012, 2008 R2, 2008 and 2005.

When I give food to the poor they call me a saint. When I ask why they are poor they call me a communist - Archbishop Hélder Câmara

David.Poole SSC Guru Points: 75656 More actions · Answer 2

NORMALISATION

Normalise when it is practical to do so. If you have a sales by day of week arrangement then there would be little point in normalising it down to a DAY/SALE table.

I tend to denormalise when implementing a data warehousing solution, particularly if a normalised query would require a large number of tables.

NUMBER OF TABLES

I try to minimize the number of tables/views in a single query.

The query optimiser has to calculate the best way of joining the tables and the number of calculations goes up geometrically as the number of tables increase.

CONSTRAINTS IN DATABASE

Let the database look after the data. If you put a unique constraint on a table then a programming error in the front end app cannot add duplicate values into your database.

If the database is a heavy use database then I limit the constraints to the bear minimum because there is a processing overhead to enforce them. For light use applications I might beef up the constraints in the database to bugs that fall through the front end app.

If multiple applications use the same database then again, I would put more constraints in the database because a bug in one application could corrupt the data for another.

WHEN TO DESIGN CONSTRAINTS

Day one.

In the apps that I develop the data is the foundation and corner stone of the application. The storage and protection of the data is fundamental to the application. I am of the opinion that planning and design should take the lion share of the project. The more projects that I become involved with (20 years experience) the more convinced I am that shortcuts taken in planning and design will return to bite you in the bum....HARD.

Get the foundations right and it makes the rest of the job easier.

LinkedIn Profile

Anthony Butler SSC Enthusiast Points: 177 More actions · Answer 3

I echo all the comments already made about normalising etc.

One thing I would say is that selecting from single tables at a time can most times be far faster than joining the tables because of the way that the optimizer works. I learned this lesson when working as an Applications Programmer a few years ago. I select rows from tables in a 'most effect' fashion - so that the table that returned the fewest row was hit first, this then passed the restriction to the next most restrictive select and so on. Because the selects were so restrictive, the performance was excellent. If you must use joins on tables, make sure that your WHERE clauses are carried out in most restrictive to least restrictive order. I can explain more about this if any wants me to.

Another trick I learned recently was that often people will do joins and have multi-level subqueries on tables when a UNION will work better. One of the programmers where I work now gave me a piece of badly performing SQL that joined about 6 tables, each two levels of subquery. It took upto 40 minutes to run. When I rewrote the query using 'UNION ALL' it took a consistent 3 seconds even across a Europe wide network.