Thread: Avoiding duplications in tables
Hi All, I have the following tables. (1) tablename : versions versionnumber - string (2) tablename : applications applicationnumber - integer other details (3) tablename : applicationnumber versionnumber - string applicationnumber - integer (4) tablename : profilemanager versionnumber - string profilename - string applicationnumber - integer other details (5) tablename : profiles versionnumber - string profilename - string Primary key : --------------- (1) versions : versionnumber (2) applicationmanager : versionnumber, applicationnumber (3) applications : applicationnumber (4) profilemanager : veriosnnumber, profilename, applicationnumber (5) profiles : versionnumber, profilename Relations : ----------- (1) versionnumber of 'applicationmanager' referes to versionnumber of 'versions' table. (2) applicationnumber of 'applicationmanager' referes to applicationnumber of 'applications' table. (3) versionnumber and applicationnumber of 'profilemanager' referes to versionnumber and applicationnumber of 'applicationmanager'. (4) versionnumber and profilename of 'profilemanager' referes ot versionnumber and profilename of 'profiles'. (5) versionnumber of 'profiles' referes to versionnumber of 'versions'. In the above design I can able to see duplication of data in all the tables. If I take out a id out of all these tables to make reference, is that problem will solve. Is that is a correct approach. Can any one help me. regards, Deepa K
Hi Deepa, Let me know that 1) Is tables ApplicationNumber and ApplicationManager are same ? 2) Why do you have more than one referential key between two tables, Which will makes process very slow. If answer to the first question as 'yes' and answer to the second question as 'I can change it to single referential key' You can look the following design which will wipe out redundancy. (1) tablename : versions versionnumber - string (2) tablename : applications applicationnumber - integer other details (3) tablename : applicationnumber -- CHANGED ApplicaitonNumberId Integer versionnumber - string applicationnumber - integer (4) tablename : profilemanager -- CHANGED ApplicationNumberId - Integer ProfileId - Integer other details (5) tablename : profiles -- CHANGED ProfileId - Integer versionnumber - string profilename - string Primary key : --------------- (1) versions : versionnumber (2) applicationmanager ( ApplicationNumber) : ApplicationNumberId -- CHANGED (3) applications : applicationnumber (4) profilemanager : ProfileId, ApplicationNumberId -- CHANGED (5) profiles : ProfileId -- CHANGED Relations : ----------- (1) versionnumber of 'applicationmanager/ApplicationNumber' referes to versionnumber of 'versions' table. (2) applicationnumber of 'applicationmanager/ApplicationNumber' referes to applicationnumber of 'applications' table. (3) ApplicationNumberId of 'profilemanager' referes to ApplicationNumberId of 'applicationmanager'. -- CHANGED (4) ProfileId of 'profilemanager' referes to ProfileId of 'profiles'. -- CHANGED (5) versionnumber of 'profiles' referes to versionnumber of 'versions'. I Think this design will serve your purpose (which will move redundancy). Sreedhar Baskararaju ---------------------------(end of broadcast)--------------------------- TIP 6: Have you searched our list archives? http://archives.postgresql.org ----- Original Message ----- From: "Deepa K" <kdeepa@midascomm.com> To: <pgsql-general@postgresql.org> Sent: Monday, March 24, 2003 12:33 PM Subject: [GENERAL] Avoiding duplications in tables > Hi All, > I have the following tables. > > (1) tablename : versions > > versionnumber - string > > (2) tablename : applications > > applicationnumber - integer > other details > > (3) tablename : applicationnumber > > versionnumber - string > applicationnumber - integer > > (4) tablename : profilemanager > > versionnumber - string > profilename - string > applicationnumber - integer > other details > > (5) tablename : profiles > > versionnumber - string > profilename - string > > Primary key : > --------------- > > (1) versions : versionnumber > (2) applicationmanager : versionnumber, applicationnumber > (3) applications : applicationnumber > (4) profilemanager : veriosnnumber, profilename, applicationnumber > (5) profiles : versionnumber, profilename > > Relations : > ----------- > > (1) versionnumber of 'applicationmanager' referes to versionnumber of > > 'versions' table. > (2) applicationnumber of 'applicationmanager' referes to > applicationnumber of 'applications' table. > (3) versionnumber and applicationnumber of 'profilemanager' referes > to versionnumber and applicationnumber of > 'applicationmanager'. > (4) versionnumber and profilename of 'profilemanager' referes ot > versionnumber and profilename of 'profiles'. > (5) versionnumber of 'profiles' referes to versionnumber of > 'versions'. > > In the above design I can able to see duplication of data in all the > tables. If I take out a id out of all these tables to make reference, is > > that problem will solve. Is that is > a correct approach. > > Can any one help me. > > regards, > Deepa K > > > ---------------------------(end of broadcast)--------------------------- > TIP 6: Have you searched our list archives? > > http://archives.postgresql.org
Look for articles on 'surrogate primary keys'. In the system you have below, if you change profilename, or any other string value that's used as a key, your database has to change all of those in all fields. With a surrogate key, that doesn't happen. This reduces database loading, and also the likelyhood of errors. Also,looking things up by strings in searches is slower. If you use a surrogate key, then only ONE search for the string is done, and the remaining searches are done for an integer, MUCH faster. The 'serial' datatype is made for this. it does complicate your SQL statements and require subselects, however. But among open source DB's, Postgres stands out as the one that has supported subsellects and foreign keys the longest and most thoroughly, so you're in luck. Here is the same table that I used for a create table syntax example, which you can now look at for a surrogate key example: CREATE TABLE PhonNums( phone_num_id serial NOT NULL PRIMARY KEY, phon_num varchar(32) NOT NULL UNIQUE, created timestamp DEFAULT CURRENT_TIMESTAMP NOT NULL ); Deepa K wrote: > Hi All, > I have the following tables. > > (1) tablename : versions > > versionnumber - string > > (2) tablename : applications > > applicationnumber - integer > other details > > (3) tablename : applicationnumber > > versionnumber - string > applicationnumber - integer > > (4) tablename : profilemanager > > versionnumber - string > profilename - string > applicationnumber - integer > other details > > (5) tablename : profiles > > versionnumber - string > profilename - string > > Primary key : > --------------- > > (1) versions : versionnumber > (2) applicationmanager : versionnumber, applicationnumber > (3) applications : applicationnumber > (4) profilemanager : veriosnnumber, profilename, applicationnumber > (5) profiles : versionnumber, profilename > > Relations : > ----------- > > (1) versionnumber of 'applicationmanager' referes to versionnumber of > > 'versions' table. > (2) applicationnumber of 'applicationmanager' referes to > applicationnumber of 'applications' table. > (3) versionnumber and applicationnumber of 'profilemanager' referes > to versionnumber and applicationnumber of > 'applicationmanager'. > (4) versionnumber and profilename of 'profilemanager' referes ot > versionnumber and profilename of 'profiles'. > (5) versionnumber of 'profiles' referes to versionnumber of > 'versions'. > > In the above design I can able to see duplication of data in all the > tables. If I take out a id out of all these tables to make reference, is > > that problem will solve. Is that is > a correct approach. > > Can any one help me. > > regards, > Deepa K > > > ---------------------------(end of broadcast)--------------------------- > TIP 6: Have you searched our list archives? > > http://archives.postgresql.org >