Saturday, March 19, 2011

GUID Or Int which is better?

Recently I read a blog on, what is better GUIDs or Integer values. This is a long debate as both have some advantages and disadvantages; my answer to this is it depends!! J

It is dependent on your database design and overall architecture of your project, it’s not that usage of GUID is a bad practice there are some disadvantages of using GUIDs-

1.    Big storage size. At 16 bytes, the uniqueidentifier data type is relatively larger than other data types, such as 4-byte integers. This means indexes that are built using uniqueidentifier keys might be relatively slower than indexes using an Int key.

2.    The values are long and obscure. This makes them difficult for users to type correctly, and more difficult for users to remember in testing.

3.    Don't have the chronology assumption.

4.    Don't benefit from a mechanism to obtain the last generated primary key (MS SQL Server).

5.    There is no way to determine the sequence in which uniqueidentifier values were generated. They are not suited for existing applications that depend on incrementing key values serially.

6.    The values are random and cannot accept any patterns that may make them more meaningful to users.


Advantages of using GUIDs-
  1. If you want to merge tables then Using GUID it’s easy to merge table.
  2. It’s easy to work with distributed tables.
  3. The primary key is uniquely identified in the entire system (the number of machines or tables doesn't matter).
  4. Don't have problems when inserting a large number of operations.

Using INT-

Advantages of using INT-
  1. Small amount of storage size (integer is 4 bytes).
  2. Increased readability, practical use in testing, and easy to remember.
  3. Chronology of data; if two records are in ascending order, we can deduce that the second record was inserted after the first one.
  4. Support for functions that return the last primary key generated (@@IDENTITY, SCOPE_IDENTITY()).
Disadvantages of using INT-
  1. Difficulty in the case of merging tables (the need to make remapping because the primary keys may be duplicated).
  2. Hard to work with distributed tables.
  3. Primary key in the form of INT/BIGINT is LUID, local unique identifier, which is only used locally in a table.
  4. After a large number of operations (insert, delete), the counter for primary key can be reset, bringing the problem of chronology to 1.
Hope someone find this information useful.

No comments:

Post a Comment