SQL(Structured Query Language)

Structured Query Language (SQL) provides the ability to create and define relational database objects. After these objects are defined, the language permits one to add data to these objects. Once data has been added, one can modify, retrieve, or delete that data. The language provides the capability of defining what type of authority one might have when accessing the data.

 

Data Definition Language

As the name implies, there is a group of SQL statements that allows one to define the relational structures that will manage the data placed in them. The “CREATE” statements brings Relational Database Management System (RDMS) objects into existence. The types of objects one can create are STOGROUP, Database, Table space, Table, Index, View, Synonym, and Alias. The definitions of these objects are as follows:

 

STOGROUP: A storage group is a list of disk volume names to which one can assign a name. One defines the list of disk volumes and assigns the STOGROUP name with the Create STOGROUP statement.

 

Database: A database is a logical structure in which tables and indexes are later created. The database is defined and associated with a STOGROUP with a Create Database statement.

 

Tablespace: A tablespace is an area on disk that is allocated and formatted by the Create Table space statement.

 

Table: A table is an organizational structure which is defined in a Create Table statement. In this statement, the data attributes are defined by column, giving each column its own unique name within the table.

 

Index: A index is used in conjuction with the “Primary Key” parameter of the Create Table statement. It is made with the Create Index statement and provides the duplicate record-checking necessary for a unique key.

 

View: A view is an alternative perspective of the data present in a database. It is made with the Create View statement and can represent a subset of the columns defined in a table. It can also represents a set of columns combined from more than one table.

 

Synonym: The Create Synonym statement defines an unqualified name for a table or a view.

 

Alias: The Create Alias statement defines an alternate qualified name for a table or a view.

 

After a table is created, additional columns may be added with an Alter Table statement. Any RDMS object that was made with a create statement can be removed with a drop statement.

 

In order to define RDMS objects, one needs various levels of authority. The following is a list of authority levels that can be granted to a user ID to operate on a designated database.

 

DBADM                Database administrator authority

DBCTRL              Database control authority

DBMAINT             Database maintenance authority

CREATETS         Create Table space Authority

CREATETAB       Create Table authority

DROP                  Drop authority on a database or subordinate objects

 

Data Manipulation Language

 

                There are four SQL data manipulation statements(DML) available: Insert, Select, Update, and Delete. After tables are defined, they are ready to store data. Data is added to tables through the SQL Insert statement. Once data has been inserted into a table, it can be retrieved by the use of the Select statement. Data stored in a table can be modified by executing the SQL Update statement. Data can be deleted from a table by using the SQL Delete statement.

                The SQL statements perform RDMS operations that can affect only one row at a time if desired. The same statements can, if required, affect many or all of the rows in a table. It is possible to select one row and insert it into another with one statement. It is also just as easy to select all of the rows from one table and insert all of them into another with a single statement. The same scope of operation applied to the update and delete statements. The scope of operation is controlled by the use of the WHERE clause. The operation will affect only the rows that satisfy the search condition. When no search condition specified, the entire table is affected.

 

                There are additional language elements available that provide the ability to process the table data while it is being retrieved. In addition, there are a variety of functions that modify the value of the data that is returned in a query. There are column functions that act on all of the values of the selected rows for a specified column and return a single answer. There are also scalar functions that return a specific answer for each row that satisfies the search condition.

 

                As mentioned previously, SQL provides the ability to filter what data is retrieved in a select statement by including the WHERE clause. The WHERE clause specifies a variety of comparisons between two values. The values could be column values or the result of an operation involving more than one column or a constant. The comparison operation are the same as those used in COBOL, with the exception of two additional operators. The first is the IN operator that compares a single value has a match in the specified list of values. The other is the LIKE operator, in which you can specify a value string that includes “wildcard” characters in such a manner that you can select rows of a table where column values are similar to the extent you require.

 

                SQL provides four arithmetic operations : addition, subtraction, multiplication, and division. An arithmetic _expression may involve any combination of column name or numbers. The arithmetic _expression may itself be used as a column name or in a Select, Insert, Update, or Delete statement.

 

                SQL provides the ability to sort the data retrieved from a table via the ORDER BY clause. In this clause, you can specify one or more sort column names as well as if each sort key is ascending or descending.

 

                SQL also provides the ability to perform set manipulation operations. Using SQL, one can SELECT the intersection of two or more sets of data by coding a JOIN. A JOIN is any SELECT statement that has more than one DBMS object listed in its FROM clause. One can combine different sets of data by using the UNION operator. Other set manipulations can be executed by combining different operators and search conditions.

 

The Following are the most frequently asked questions....

 

  Q1)  What RDMS objects are created with the SQL CREATE statements?

  A1)  The SQL CREATE statements are used to create the following objects:

                STOGROUP         A storage group

                DATABASE           A logical collection of tables

                TABLESPACE      An area that stores tables

                TABLE                   A data structure organized by a specified columns

                INDEX                   An alternate path to a table data

                VIEW                     An alternate representation of one or more tables

                SYNONYM            An alternate name for local table or view

                ALIAS                    An alternate name for a table definition which may be local   or remote, existence or nonexistent

 

  Q2)  What RDMS objects are required before you can create a table?

  A2)  Before you can create a table, you need an existing database and tablespace.

 

  Q3)  In what RDMS object does one first list column names?

  A3)  One first uses the column name in the CREATE TABLE statement.

 

  Q4)  What is the syntax for a CREATE TABLE statement?

  A4)  CREATE TABLE table name

                                (column name list

                                primary key (column name))

                                in database-name, tablespace-name.

 

  Q5)  Can one add columns to a table after it has been defined?

  A5)  Yes, one can add column to a table after it has been defined by using the SQL ALTER TABLE statement.

 

  Q6)  Where in a table are added columns located?

  A6)  The new columns are added to the end of the table.

 

  Q7)  After a table is defined, can columns be removed?

  A7)  The only way to remove columns from an existing table involves a migration program that extracts only the desired  

columns of data, redefining the table without the unwanted columns, then populating the new table. One have to handle

all the old table’s dependents programmatically.

 

  Q8)  Which RDMS objects can you change with the SQL ALTER statements?

  A8)  The SQL ALTER statement can change a table index, a table, a tablespace, or a STOGROUP.  

 

  Q9)  What authority is required to create a table?

  A9)  In order to create tables, one needs CREATETAB privileges.

 

  Q10)    What is minimum authority required for one to create a tablespace?

  A10)     In order to create tablespaces, one needs CREATETS privileges.

 

  Q11)    When is it necessary to create a table index?

  A11)     It is necessary to create a table index whenever you want to enforce the uniqueness of the table’s primary key.

 

  Q12)    What is a synonym?

  A12)     A synonym is an unqualified alternative name for a table or view.

 

  Q13)    What is a foreign key?

  A13)     A foreign key is the key defined in one table to reference the primary key of a reference table. This foreign key must  

have the same structure as the reference table’s primary key.

 

  Q14)    What is referential integrity?

  A14)     Referential integrity is the automatic enforcement of referential constraints that exist between a reference table and a  

referencing table. When referential integrity is enforced, the value of a foreign key exists as  a primary key value in the

reference table. In other words, when referential integrity is enforced, all of the foreign key values in, for example, the 

“department code” column in an “employee” table exist as primary key values in a “department” table.

 

  Q15)    What are the column name qualifiers?

  A15)     A column name qualifier are used as a table designator to avoid ambiguity when the column names referenced exists  

in more than one table used in the SQL statement. Column name qualifiers are also used in correlated references.

 

  Q16)    What is a correlation name?

  A16)     A correlation name is a special type of column designator that connects specific columns in the various levels of a  

multilevel SQL query.

 

  Q17)    What is a results table?

  A17)     A result table is the product of a query against one or more tables or views (i.e., it is the place that holds the results of a  

query).

 

  Q18)    What is a cursor?

  A18)     A cursor is a named control structure used to make a set of rows available to a program. DB2 is the relational database  

system that runs in an MVS environment. It was developed by IBM and interfaces with SQL. With the use of SQL

DB2, databases can be accessed by a wide range of host languages. SQL is the relational database " application

language " that interfaces with DB2. Because of its capabilities, SQL and, in turn, DB2 have gained considerable

acceptance. Thus, a working knowledge of DB2 increases one's marketability.

 

  Q19)    What is the basic difference between a join and a union?  

  A19)           A join selects columns from 2 or more tables. A union selects rows.   

 

  Q20)    What is normalization and what are the five normal forms?  

  A20)     Normalization is a design procedure for representing data in tabular format.  The five normal forms are progressive  

rules to represent the data with minimal redundancy.

  Q21)    What are foreign keys?  

  A21)     These are attributes of one table that have matching values in a primary key in another table, allowing for relationships  

between tables.

 

  Q22)    Describe the elements of the SELECT query syntax?  

  A22)     SELECT element FROM table WHERE conditional statement.  

 

  Q23)    Explain the use of the WHERE clause?  

  A23)     WHERE is used with a relational statement to isolate the object element or row.  

 

  Q24)    What techniques are used to retrieve data from more than one table in a single SQL statement?  

  A24)     Joins, unions and nested selects are used to retrieve data.  

 

  Q25)    What is a view? Why use it?  

  A25)     A view is a virtual table made up of data from base tables and other views, but not stored separately.  

 

  Q26)    Explain an outer join?  

  A26)     An outer join includes rows from tables when there are no matching values in the tables.  

 

  Q27)    What is a subselect?  Is it different from a nested select?  

  A27)     A subselect is a select which works in conjunction with another select. A nested select is a kind of subselect where the  

inner select passes to the where criteria for the outer select.

 

  Q28)    What is the difference between group by and order by?  

  A28)     Group by controls the presentation of the rows, order by controls the presentation of the columns  for the results of the  

SELECT statement.

 

  Q29)    What keyword does an SQL SELECT statement use for a string search?  

  A29)     The LIKE keyword allows for string searches.  The % sign is used as a wildcard.  

 

  Q30)    What are some SQL aggregates and other built-in functions?  

  A30)     The common aggregate, built-in functions are AVG, SUM, MIN, MAX, COUNT and DISTINCT.  

 

  Q31)    How is the SUBSTR keyword used in SQL?  

  A31)     SUBSTR is used for string manipulation with column name, first position and string length used as arguments.  E.g.  

SUBSTR (NAME, 1 3) refers to the first three characters in the column NAME.

 

  Q32)    Explain the EXPLAIN statement?  

  A32)           The explain statement provides information about the optimizer's choice of access path of the SQL.  

 

  Q33)    What is referential integrity?  

  A33)     Referential integrity refers to the consistency that must be maintained between primary and foreign keys, i.e. every  

foreign key value must have a corresponding primary key value. 

 

  Q34)    What is a NULL value?  What are the pros and cons of using NULLS?  

  A34)     A NULL value takes up one byte of storage and indicates that a value is not present as opposed to a space or zero  

value.  It's the DB2 equivalent of TBD on an organizational chart and often correctly portrays a business situation. 

Unfortunately, it requires extra coding for an application program to handle this situation.

 

  Q35)    What is a synonym?  How is it used?  

  A35)     A synonym is used to reference a table or view by another name.  The other name can then be written in the  

application code pointing to test tables in the development stage and to production entities when the code is migrated. 

The synonym is linked to the AUTHID that created it.

 

 

  Q36)    What is an alias and how does it differ from a synonym?  

  A36)     An alias is an alternative to a synonym, designed for a distributed environment to avoid having to use the location  

qualifier of a table or view.  The alias is not dropped when the table is dropped.

 

  Q37)    When can an insert of a new primary key value threaten referential integrity?  

  A37)     Never. New primary key values are not a problem.  However, the values of foreign key inserts must have  

corresponding primary key values in their related tables. And updates of primary key values may require changes in

foreign key values to maintain referential integrity.

 

  Q38)    What is the difference between static and dynamic SQL?  

  A38)     Static SQL is hard-coded in a program when the programmer knows the statements to be executed.  For dynamic SQL  

the program must dynamically allocate memory to receive the query results.

 

  Q39)    Compare a subselect to a join?  

  A39)     Any subselect can be rewritten as a join, but not vice versa. Joins are usually more efficient as join rows can be  

returned immediately, subselects require a temporary work area for inner selects results while processing the outer

select.

 

  Q40)    What is the difference between IN subselects and EXISTS subselect?  

  A40)     If there is an index on the attributes tested an IN is more efficient since DB2 uses the index for the IN.  (IN for index is  

the mnemonic).

 

  Q41)    What is a Cartesian product?  

  A41)     A Cartesian product results from a faulty query.  It is a row in the results for every combination in the join tables.  

 

  Q42)    What is a tuple?  

  A42)     A tuple is an instance of data within a relational database.  

 

  Q43)    What is the difference between static and dynamic SQL?  

  A43)           Static SQL is compiled and optimized prior to its execution; dynamic is compiled and optimized during execution.  

 

 

  Q44)    Any SQL implementation covers data types in couple of main categories. Which of the following are those data  

types ? (Check all that apply)

A). NUMERIC  

B). CHARACTER  

C). DATE AND TIME  

D). BLOBS   E. BIT

  A44)     A,B,C. Not all SQL implementations have a BLOB or a BIT data types.  

 

  Q45)    We have a table with a CHARACTER data type field. We apply a ">" row comparison between this field and  

another CHARACTER field in another table. What will be the results for records with field value of NULL?

(Check one that applies the best)

  A.      TRUE   

  B.       B. FALSE   

  C.       C. UNKNOWN   

  D.      D. Error.  

  E.       E. Those records will be ignored  

  A45)     C. NULL in a row when compared will give an UNKNOWN result.  

 

  Q46)    Any database needs to go through a normalization process to make sure that data is represented only once. This  

            will eliminate problems with creating or destroying data in the database. The normalization process is done

            usually in three steps which results in first, second and third normal forms. Which best describes the process to

            obtain the third normal form? (Check one that applies the best)

  A.      Each table should have related columns.    

  B.       Each separate table should have a primary key.    

  C.       We have a table with multi-valued key. All columns that are dependent on only one or on some of the keys should be moved in a different table.

  D.      If a table has columns not dependent on the primary keys, they need to be moved in a separate table.    

  E.       E. Primary key is always UNIQUE and NOT NULL.  

  A46)     D. All columns in a table should be dependent on the primary key. This will eliminate transitive dependencies in  

which A depends on B, and B depends on C, but we're not sure how C depends on A.

 

  Q47)    SQL can be embedded in a host program that uses a relational database as a persistent data repository. Some of  

the most important pre-defined structures for this mechanism are SQLDA ("SQL Descriptor Area") and

SQLCA ("SQL Communications Area") SQLCA contains two structures - SQLCODE and SQLSTATE.

SQLSTATE is a standard set of error messages and warnings in which the first two characters defines the class

and the last three defines the subclass of the error. Which of the following SQLSTATE codes is interpreted as

"No data returned"?(Check one that applies the best) 

A). 00xxx  

B). 01xxx 

C). 02xxx  

D). 22xxx  

E). 2Axxx

  A47)     C. 00 - is successful completion, 01 - warnings, 22 - is data exception and 2A is syntax error. The SQLSTATE code  

format returned for "No data returned" is "02xxx".

 

  Q48)    What are common SQL abend codes? (e.g. : 0,100 etc.,)

  A48)     -818 time stamp mismatch

-180 wrong data moved into date field

      

  Q49)    What is meant by dynamic SQL?

  A49)     Dynamic SQL are SQL statements that are prepared and executed within a program while the program is executing.  

The SQL source is contained in host variables rather than being hard coded into the program. The SQL statement may

change from execution to execution.

 

  Q50)    What is meant by embedded SQL?

  A50)     They are SQL statements that are embedded with in application program and are prepared during the program

 preparation process before the program is executed. After it is prepared, the statement itself does not change(although

values of host variables specified within the statement might change).

 

  Q51)    What is meant by entity integrity?

  A51)     Entity integrity is when the primary key is in fact unique and not null.

 

  Q52)    What will EXPLAIN do?

  A52)     EXPLAIN obtains information (which indexes are used, whether sorting is necessary, which level of locking is  

applied) about how SQL statements in the DBRM will be executed, inserting this information into the 

“X”.PLAN.TABLE where the “X” is the authorization ID of the owner of the plan.

 

  Q53)    What is the foreign key?  

  A53)     A foreign key is a column (or combination of columns) in a table whose values are required to match those of the  

primary key in some other table.

 

  Q54)    What will GRANT option do?

  A54)     It will grant privileges to a list of one or more users. If the GRANT option is used in conjunction with the “PUBLIC”  

option, then all users will be granted privileges. Also you can grant privileges by objects and types.

 

  Q55)    What does the term “grant privileges” mean?

  A55)     Grant privileges means giving access/authority to   DB2 users.

 

  Q56)    What is an image copy?

  A56)     It is an exact reproduction of all or part of a tablespace. DB2 provides utility programs to make full-image copies (to  

copy the entire tablespace) or incremental image copies to copy only those pages that have been modified since the last

image copy.

 

  Q57)    What is meant by an index?

  A57)     An index is a set of row identifiers (RIDs) or pointers that are logically ordered by the values of a column that has  

been specified as being an index. Indexes provide faster access to data and can enforce uniqueness on the row in a

table.

 

  Q58)    What is an index key?

  A58)     It is a column or set of columns in a table used to determine the order of index entries.

 

  Q59)    What is a join?

  A59)     A join is a relational operation that allows retrieval of data from two or more tables based on matching columns  

values.

 

  Q60)    What is meant by locking?

  A60)     Locking is a process that is used to ensure data integrity. It also prevents concurrent users from accessing inconsistent  

data. The data (row) is locked until a commit is executed to release the updated data.

 

  Q61)    What is meant by null?

  A61)     This is a special value that indicates the absence of data in a column. This value is indicated by a negative value,  

usually -1.

 

  Q62)    What is an object?

  A62)     An object is anything that is managed by DB2 (that is databases, table spaces, tables, views, indexes or synonyms), but  

not the data itself.

 

  Q63)    Describe referential integrity?

  A63)     Referential integrity refers to a feature in DB2 that is used to ensure consistency of the data in the database.

 

  Q64)    Describe a primary key?

  A64)     A primary key is a key that is unique, non-null, and is part of the definition of a table. A table must have a primary key  

to be defined as a parent.

 

  Q65)    How would you find out the total number of rows in a table? - GS

  A65)     Use SELECT COUNT(*) ...

 

  Q66)    How do you eliminate duplicate values in SELECT? - GS

  A66)     Use SELECT DISTINCT ...

 

  Q67)    How do you select a row using indexes? - GS

  A67)     Specify the indexed columns in the WHERE clause.

 

  Q68)    What are aggregate functions?

  A68)     Bulit-in mathematical functions for use in SELECT clause.

 

  Q69)    How do you find the maximum value in a column? - GS

  A69)     Use SELECT MAX(...

 

  Q70)    Can you use MAX on a CHAR column?

  A70)     YES.

 

  Q71)    My SQL statement SELECT AVG(SALARY) FROM EMP-TABLE yields inaccurate results. Why?

  A71)     Because SALARY is not declared to have Null’s and the employees for whom the salary is not known are also  

counted.

 

  Q72)    How do you retrieve the first 5 characters of FIRSTNAME column of EMP table?

  A72)     SELECT SUBSTR(FIRSTNAME,1,5) FROM EMP;

 

  Q73)    How do you concatenate the FIRSTNAME and LASTNAME from EMP table to give a complete name?

  A73)     SELECT FIRSTNAME || ' ' || LASTNAME FROM EMP;  

 

  Q74)    What is the use of VALUE function?

  A74)     Avoid negative SQLCODEs by handling nulls and zeroes in computations.

Substitute a numeric value for any nulls used in computation.

 

  Q75)    What is UNION ,UNION ALL? - GS

  A75)     UNION                       eliminates duplicates

UNION ALL:             retains duplicates

Both these are used to combine the results of different SELECT statements.

 

  Q76)    Suppose I have five SQL SELECT statements connected by UNION/UNION ALL, how many times should I  

specify UNION to eliminate the duplicate rows? - GS

  A76)           Once.

 

  Q77)    What is the restriction on using UNION in embedded SQL?

  A77)     It has to be in a CURSOR.

 

  Q78)    In the WHERE clause what is BETWEEN and IN? - GS

  A78)     BETWEEN supplies a range of values while IN supplies a list of values.

 

  Q79)    Is BETWEEN inclusive of the range values specified? - GS

  A79)     Yes.

 

  Q80)    What is 'LIKE' used for in WHERE clause? What are the wildcard characters? - GS

  A80)     LIKE is used for partial string matches.  '%' ( for a string of any character ) and '_' (for any single character ) are the  

two wild card characters.

 

  Q81)    When do you use a LIKE statement?

  A81)     To do partial search e.g. to search employee by name, you need not specify the complete name; using LIKE, you can  

search for partial string matches.

 

  Q82)    What is the meaning of underscore ( '_' ) in the LIKE statement? - GS

  A82)     Match for any single character.

 

  Q83)    What do you accomplish by GROUP BY ... HAVING clause? - GS

  A83)     GROUP BY partitions the selected rows on the distinct values of the column on which you group by. HAVING selects  

GROUPs which match the criteria specified

 

  Q84)    Consider the employee table with column PROJECT nullable.  How can you get a list of employees who are not  

assigned to any project?

  A84)           SELECT EMPNO FROM EMP WHERE PROJECT IS NULL;

 

  Q85)    What is the result of this query if no rows are selected:

SELECT SUM(SALARY) FROM EMP WHERE QUAL='MSC';

  A85)     NULL

 

  Q86)    Why SELECT * is not preferred in embedded SQL programs?

For  three reasons:

If the table structure is changed (a field is added), the program will have to be modified

Program might retrieve the columns which it might not use, leading on I/O over head.

The chance of an index only scan is lost.

 

  Q87)    What are correlated subqueries? - GS

A subquery in which the inner ( nested ) query refers back to the table in the outer query. Correlated

subqueries must be evaluated for each qualified row of the outer query that is referred to.

 

  Q88)    What is a cursor? Why should it be used? - GS

Cursor is a programming device that allows the SELECT to find a set of rows but return them one at a time.

Cursor should be used because the host language can deal with only one row at a time.

 

  Q89)    How would you retrieve rows from a DB2 table in embedded SQL? - GS

Either by using the single row SELECT statements,or by using the CURSOR.

 

  Q90)    Apart from cursor, what other ways are available to you to retrieve a row from a table in embedded SQL? - GS

Single row SELECTs.

 

  Q91)    How do you specify and use a cursor in a COBOL program? - GS

Use DECLARE CURSOR statement either in working storage or in procedure division (before open cursor),

to specify the SELECT statement.  Then use OPEN, FETCH rows in a loop and finally CLOSE.

 

  Q92)    What happens when you say OPEN CURSOR?

If there is an ORDER BY clause, rows are fetched, sorted and made available for the FETCH statement. Other wise simply the cursor is placed on the first row.

 

  Q93)    Is DECLARE CURSOR executable?

No.

 

  Q94)    Can you have more than one cursor open at any one time in a program ? - GS

Yes.

 

  Q95)    When you COMMIT, is the cursor closed?

Yes.

 

  1.        What is SQLCA and SQLDA?

  2.        What is 2 phase commit?

 

 


DATABASE 2(DB2)

 

The questions and answers that follow are intended for those with a working knowledge of DB2 as a “self-test”.

 

  Q1)  What is DB2 (IBM Database 2)?

  A1)  DB2 is a subsystem of the MVS operating system. It is a Database Management System (DBMS) for that operating system.

 

  Q2)  What is an access path?

  A2)  The path that is used to get to data specified in SQL statements.

 

  Q3)  What is an alias?

  A3)  It is an alternate name that can be used in SQL statements to refer to a table or view in the same or remote DB2 subsystem.

 

  Q4)  Explain what a plan is?

  A4)  Plan is a DB2 object (produced during the bind process) that associates one or more database request modules with a plan name.

 

  Q5)  What is a DB2 bind?

  A5)  Bind is a process  that builds “access paths” to DB2 tables. A bind uses the Database Request Modules(s) (DBRM(s)) from the DB2 pre-compile step as input and produces an application plan. It also checks the user’s authority and validates the SQL statements in the DBRM(s).  

 

  Q6)  What information is used as input to the bind process?

  A6)  The database request module produced during the pre-compile. The SYSIBM.SYSSTMT table of the DB2 catalog.

 

  Q7)  What is meant by the attachment facility?

  A7)  The attachment facility is an interface between DB2 and TSO, IMS/VS, CICS, or batch address spaces. It allows application programs to access DB2.

 

  Q8)  What is meant by AUTO COMMIT?

  A8)  AUTO COMMIT is a SPUFI option that commits the effects of SQL statements automatically if they are successfully executed.

 

  Q9)  What is a base table?

  A9)  A base table is a real table -  a table that physically exists in that there are physical stored records.

 

  Q10)    What is the function of buffer manager?

  A10)           The buffer manager is the DB2 component responsible for physically transferring data between an external medium and (virtual) storage (performs the actual I/O operations). It minimizes the amount of physical I/O actually performed with sophisticated buffering techniques(i.e., read-ahead buffering and look-aside buffering).

 

  Q11)    What is a buffer pool?

  A11)           A buffer pool is main storage that is reserved to satisfy the buffering requirements for one or more tablespaces or indexes, and is made up of either 4K or 32K pages.

 

  Q12)    How many buffer pools are there in DB2?

  A12)           There are four buffer pools: BP0, BP1, BP2, and BP32.

 

  Q13)    On the create tablespace, what does the CLOSE parameter do?

  A13)           CLOSE physically closes the tablespace when no one is working on the object. DB2 (release 2.3) will logically close tablespaces.

 

  Q14)    What is a clustering index?

  A14)           It is a type of index that (1) locates table rows and (2) determines how rows are grouped together in the tablespace.

 

  Q15)    What will the COMMIT accomplish?

  A15)           COMMIT will allow data changes to be permanent. This then permits the data to be accessed by other units of work. When a COMMIT occurs, locks are freed so other applications can reference the just committed data.

 

  Q16)    What is meant by concurrency?

  A16)           Concurrency is what allows more than one DB2 application process to access the same data at essentially the same time. Problems may occur, such as lost updates, access to uncommitted data, and un-repeatable reads.

 

  Q17)    What is cursor stability?

  A17)           It is cursor stability that “tells” DB2 that database values read by this application are protected only while they are being used.   (Changed values are protected until this application reaches the commit point). As soon as a program moves from one row to another, other programs may read or the first row.

 

  Q18)    What is the function of the Data Manager?

  A18)           The Data Manager is a DB2 component that manager the physical databases. It invokes other system components, as necessary, to perform detailed functions such as locking, logging, and physical I/O operations (such as search, retrieval, update, and index maintenance).

 

  Q19)    What is a Database Request Module(DBRM)?

  A19)           A DBRM is a DB2 component created by the DB2 pre-compiler containing the SQL source statements extracted from the application program. DBRMs are input to the bind process.

 

  Q20)    What is a data page?

  A20)           A data page is a unit of retrievable data, either 4K or 32K (depending on how the table is defined), containing user or catalog information.

 

  Q21)    What are data types?

  A21)           They are attributes of columns, literals, and host variables. The data types are SMALLINT, INTEGER, FLOAT, DECIMAL, CHAR, VARCHAR, DATE and TIME.

 

  Q22)    What is Declaration Generator(DCLGEN)?

  A22)           DCLGEN is a facility that is used to generate SQL statements that describe a table or view. These table or view descriptions are then used to check the validity of other SQL statements at precompile time. The table or view declares are used by the DB2I utility DCLGEN to build a host language structure, which is used by the DB2 precompiler to verify that correct column names and data types have been specified in the SQL statement.

 

  Q23)    What does DSNDB07 database do?

  A23)           DSNDB07 is where DB2 does its sorting. It includes DB2’s sort work area and external storage.

 

  Q24)    What will the FREE command do to  a  plan?

  A24)           It will drop(delete) that existing plan.

 

  Q25)    What is a host variable?

  A25)           This is a data item that is used in an SQL statement to receive a value or to supply a value. It must be preceded by a colon (:) to tell DB2 that the variable is not a column name.

 

  Q26)    What will the DB2 optimizer do?

  A26)           The optimizer is a DB2 component that processes SQL statements and selects the access paths.

 

  Q27)    What is a page?

  A27)           This is the unit of storage within a table space or indexspace that is accessed by DB2.

 

  Q28)    What is pagespace?

  A28)           Pagespace refers to either to an unpartitioned table, to an index space, or to a single partition of a partitioned table of index space.

 

  Q29)    What is a predicate?

  A29)           A predicate is an element of a search condition that expresses or implies a comparison operation.

 

  Q30)    What is a recovery log?

  A30)           A recovery log is a collection of records that describes the sequence of events that occur in DB2. The information is needed for recovery in the event of a failure during execution.

 

  Q31)    What is a Resource Control Table(RCT)? Describe its characteristics.

  A31)           The RCT is a table that is defined to a DB2/CICS region. It contains control characteristics which are assembled via the DSNCRCT macros. The RCT matches the CICS transaction ID to its associated DB2 authorization ID  and plan ID(CICS attachment facility).

 

  Q32)    Where are plans stored?

  A32)           Each plan is defined uniquely in the SYSIBM.SYSPLANS table to correspond to the transaction (s) that are to execute that plan.

 

  Q33)    What is meant by repeatable read?

  A33)           When an application program executes with repeatable read protection, rows referenced by the program can’t be changed by  other programs until the program reaches a commit point.

 

  Q34)    Describe what a storage group(STOGROUP) is?

  A34)           A STOGROUP is a named collection of DASD volumes to be used by tablespaces and index spaces of databases. The volumes of STOGROUP must be of the same device type.

 

  Q35)    How would you move a tablespace (using STOGROUP) to a different DASD volume allocated to that tablespace?

  A35)           If the tablespace used is only allocated to that STOGROUP:

                - ALTER STOGROUP - add volume (new) delete volume(old)

  -          REORG TABLESPACE or RECOVER TABLESPACE

            Create a new STOGROUP that points to the new volume. ALTER the tablespace and REORG or RECOVER the

            tablespace.

 

  Q36)    What is the format (internal layout) of “TIMESTAMP”?

  A36)           This is a seven part value that consists of a date (yymmdd) and time(hhmmss and microseconds).

 

  Q37)    What is meant by a unit of recovery?

  A37)           This is a sequence of operations within a unit of work(i.e., work done between commit points).

 

  Q38)    Can DASD types assigned to storage groups be intermixed(i.e., 3350s and 3380s)?

  A38)           No

 

  Q39)    What are the three types of page locks that can be held?

  A39)           Exclusive, update, and share.

 

  Q40)    Can DB2 be accessed by TSO users? If yes, which command is used to invoke DB2?

  A40)           DB2 can be invoked by TSO users by using the DSN RUN command.

 

  Q41)    How are write I/Os from the buffer pool executed?

  A41)           Asynchronously.

 

  Q42)    What is a DB2 catalog?

  A42)           The DB2 catalog is a set of tables that contain information about all of the DB2 objects(tables, views, plans etc.).

 

  Q43)    In which column of which DB2 catalog would you find the length of the rows for all tables?

  A43)           In the RECLENGTH column of SYSIBM.SYSTABLES

 

  Q44)    What information is held in SYSIBM.SYSCOPY?

  A44)           The SYSIBM.SYSCOPY table contains information about image copies made of the tablespaces.

 

  Q45)    What information is contained in a SYSCOPY entry?

  A45)           Included is the name of the database, the table space name, and the image copy type(full or incremental etc.,) as well as the date and time each copy was made.

 

  Q46)    What information can you find in SYSIBM.SYSLINKS table?

  A46)           The SYSIBM.SYSLINKS table contains information about the links between tables created by referential constraints.

 

  Q47)    Where would you find information about the type of database authority held by the user?

  A47)           SYSIBM.SYSDBAUTH.

 

  Q48)    Where could you look if you had a question about whether a column has been defined as an index?

  A48)           This information can be found in SYSIBM.SYSINDEXES.

 

  Q49)    Once you create a view, where would information about the view be stored?

  A49)           When a view is created, system information about the view is stored in SYSIBM.SYSVIEWS.

 

  Q50)    What is the SQL Communications Area and what are some of its key fields?  

  A50)           It is a data structure that must be included in any host-language program using SQL.  It is used to pass feedback about the sql operations to the program.  Fields are return codes, error messages, handling codes and warnings.  

 

  Q51)    What is DCLGEN?  

  A51)           DCLGEN stands for declarations generator;  it is a facility to generate DB2 sql data structures in COBOL or PL/I programs.  

 

  Q52)    How do you leave the cursor open after issuing a COMMIT?  (for DB2 2.3 or above only)

  A52)           Use WITH HOLD option in DECLARE CURSOR statement.  But, it has not effect in psuedo-conversational CICS programs.

 

  Q53)    Give the COBOL definition of a VARCHAR field.

  A53)           A VARCHAR column REMARKS would be defined as follows:

...

10 REMARKS.

              49 REMARKS-LEN    PIC S9(4) USAGE COMP.

              49 REMARKS-TEXT   PIC X(1920).

 

  Q54)    What is the physical storage length of each of the following DB2 data types: DATE, TIME, TIMESTAMP?

  A54)           DATE:                    4bytes

TIME:                         3bytes

TIMESTAMP:          10bytes

 

  Q55)    What is the COBOL picture clause of the following DB2 data types:  DATE, TIME, TIMESTAMP?

  A55)           DATE:                    PIC X(10)

TIME :                        PIC X(08)

TIMESTAMP:          PIC X(26)

 

  Q56)    What is the COBOL picture clause for a DB2 column defined as DECIMAL(11,2)? - GS

  A56)           PIC S9(9)V99 COMP-3.

Note: In DECIMAL(11,2), 11 indicates the size of the data type and 2 indicates the precision.

 

  Q57)    What is DCLGEN ? - GS

  A57)           DeCLarations GENerator: used to create the host language copy books for the table definitions. Also creates the DECLARE table.

 

  Q58)    What are the contents of a DCLGEN? - GS

  A58)           EXEC SQL DECLARE TABLE statement which gives the layout of the table/view in terms of DB2 datatypes.

A host language copy book that gives the host variable definitions for the column names.

 

  Q59)    Is it mandatory to use DCLGEN? If not, why would you use it at all? - GS

  A59)           It is not mandatory to use DCLGEN. Using DCLGEN, helps detect wrongly spelt column names etc. during the pre-compile stage itself (because of the DECLARE TABLE ).  DCLGEN being a tool, would generate accurate host variable definitions for the table reducing chances of error.

 

  Q60)    Is DECLARE TABLE in DCLGEN necessary? Why it used?

  A60)           It not necessary to have DECLARE TABLE statement in DCLGEN.  This is used by the pre-compiler to validate the table-name, view-name, column name etc., during pre-compile.

 

  Q61)    Will precompile of an DB2-COBOL program bomb, if DB2 is down?

  A61)           No.  Because the precompiler does not refer to the DB2 catalogue tables.

 

  Q62)    How is a typical DB2 batch program executed ?  

  A62)           There are two methods of executing a DB2-batch program  

1. Use DSN utility to run a DB2 batch program from native TSO.  An example is shown:

DSN SYSTEM(DSP3)

    RUN PROGRAM(EDD470BD) PLAN(EDD470BD) LIB('EDGS01T.OBJ.LOADLIB')

END

2. Use IKJEFT01 utility program to run the above DSN command in a JCL.

 

  Q63)    Assuming that a site's standard is that pgm name = plan name, what is the easiest way to find out which  

programs are affected by change in a table's structure?

  A63)           Query the catalogue tables SYSPLANDEP and SYSPACKDEP.

 

  Q64)    Name some fields from SQLCA.

  A64)           SQLCODE, SQLERRM, SQLERRD

 

  Q65)    How can you quickly find out the number of rows updated after an update statement?

  A65)           Check the value stored in SQLERRD(3).

 

  Q66)    What is EXPLAIN? - GS

  A66)           EXPLAIN is used to display the access path as determined by the optimizer for a SQL statement.  It can be used in SPUFI (for single SQL statement) or in BIND step (for embedded SQL).  The results of EXPLAIN is stored in U.PLAN_TABLE where U is the authorization id of the user

 

  Q67)    What do you need to do before you do EXPLAIN?

  A67)           Make sure that the PLAN_TABLE is created under the AUTHID.

 

  Q68)    Where is the output of EXPLAIN stored? - GS

  A68)           In USERID.PLAN_TABLE   

 

  Q69)    EXPLAIN has output with MATCHCOLS = 0.  What does it mean? - GS

  A69)           A nonmatching index scan if ACCESSTYPE = I.

 

  Q70)    How do you do the EXPLAIN of a dynamic SQL statement?

  A70)           There are two methods to achieve this:

1. Use SPUFI or QMF to EXPLAIN the dynamic SQL statement

2. Include EXPLAIN command in the embedded dynamic SQL statements

 

  Q71)    How do you simulate the EXPLAIN of an embedded SQL statement in SPUFI/QMF? Give an example with a host variable in WHERE clause)

  A71)           Use a question mark in place of a host variable (or an unknown value). For instance,  

SELECT EMP_NAME FROM EMP WHERE EMP_SALARY > ?

 

  Q72)    What are the isolation levels possible ? - GS

  A72)           CS:          Cursor Stability

RR:              Repeatable Read

 

  Q73)    What is the difference between CS and RR isolation levels?

  A73)           CS:          Releases the lock on a page after use

RR:             Retains all locks acquired till end of transaction

 

  Q74)    When do you specify the isolation level? How?

  A74)           During the BIND process(ISOLATION LEVEL is a parameter for the bind process).  ISOLATION ( CS/RR )...

 

  Q75)    I use CS and update a page.  Will the lock be released after I am done with that page?

  A75)           No.

 

  Q76)    What are the various locking levels available?

  A76)           PAGE, TABLE, TABLESPACE

 

  Q77)    How does DB2 determine what lock-size to use?

  A77)           There are three methods to determine the lock-size.  They are:

1. Based on the lock-size given while creating the tablespace

2. Programmer can direct the DB2 what lock-size to use

3. If lock-size ANY is specified, DB2 usually choses a lock-size of PAGE

 

  Q78)    What are the disadvantages of PAGE level lock?

  A78)           High resource utilization if large updates are to be done

 

  Q79)    What is lock escalation?

  A79)           Promoting a PAGE lock-size to table or tablespace lock-size when a transaction has aquired more locks than specified in NUMLKTS.  Locks should be taken on objects in single tablespace for escalation to occur.

 

  Q80)    What are the various locks available?

  A80)           SHARE, EXCLUSIVE, UPDATE

 

  Q81)    Can I use LOCK TABLE on a view?

  A81)           No.  To lock a view, take lock on the underlying tables.

 

  Q82)    What is ALTER ? - GS

  A82)           SQL command used to change the definition of DB2 objects.

 

  Q83)    What is a DBRM, PLAN ?

  A83)           DBRM: Data Base Request Module, has the SQL statements extracted from the host language program by  the pre-compiler. PLAN: A result of the BIND process.  It has the executable code for the SQL statements in the DBRM.

 

  Q84)    What is ACQUIRE/RELEASE in BIND?

  A84)           Determine the point at which DB2 acquires or releases locks against table and tablespaces, including intent locks.

 

  Q85)    What else is there in the PLAN apart from the access path? - GS

  A85)           PLAN has the executable code for the SQL statements in the host program

 

  Q86)    What happens to the PLAN if index used by it is dropped?

  A86)           Plan is marked as invalid. The next time the plan is accessed, it is rebound.

 

  Q87)    What are PACKAGES ? - GS

  A87)           They contain executable code for SQL statements for one DBRM.

 

  Q88)    What are the advantages of using a PACKAGE?

  A88)           The advantages of using PACKAGE are:

1. Avoid having to bind a large number of DBRM members into a plan

2. Avoid cost of a large bind

3. Avoid the entire transaction being unavailable during bind and automatic rebind of a plan

4. Minimize fallback complexities if changes result in an error.

 

  Q89)    What is a collection?

  A89)           A user defined name that is the anchor for packages.  It has not physical existence.  Main usage is to group packages.

 

  Q90)    In SPUFI suppose you want to select maximum of 1000 rows, but the select returns only 200 rows. What are the 2 SQLCODEs that are returned? - GS

  A90)           +100 (for successful completion of the query), 0 (for successful COMMIT  if AUTOCOMMIT is set to Yes).

 

  Q91)    How would you print the output of an SQL statement from SPUFI? - GS

  A91)           Print the output dataset.

 

  Q92)    Lot of updates have been done on a table due to which indexes have gone haywire.  What do you do?

  A92)           Looks like index page split has occurred.  DO a REORG of the indexes.

 

  Q93)    What is dynamic SQL? - GS

  A93)           Dynamic SQL is a SQL statement created at program execution time.

Q94)    When is the access path determined for dynamic SQL? - GS

  A94)           At run time, when the PREPARE statement is issued.

 

  Q95)    Suppose I have a program which uses a dynamic SQL and it has been performing well till now.  Off late, I find that the performance has deteriorated.  What happened? - GS

  A95)           There may be one of the following reasons:

Probably RUN STATS is not done and the program is using a wrong index due to incorrect stats.

Probably RUNSTATS is done and optimizer has chosen a wrong access path based on the latest statistics.

 

  Q96)    How does DB2 store NULL physically?

  A96)           As an extra-byte prefix to the column value. Physically, the null prefix is Hex '00' if the value is present and Hex 'FF' if it is not.

 

  Q97)    How do you retrieve the data from a nullable column? - GS

  A97)           Use null indicators. Syntax ... INTO :HOSTVAR:NULLIND

 

  Q98)    What is the picture clause of the null indicator variable? - GS

  A98)           S9(4) COMP.

 

  Q99)    What does it mean if the null indicator has -1, 0, -2? - GS

  A99)           -1             : the field is null;  0               : the field is not null; -2       : the field value is truncated

 

  Q100)       How do you insert a record with a nullable column?

  A100)        To insert a NULL, move -1 to the null indicator, To insert a valid value, move 0 to the null indicator

 

  Q101)       What is RUNSTATS? - GS

  A101)        A DB2 utility used to collect statistics about the data values in tables which can be used by the optimizer to decide the access path. It also collects statistics used for space management.  These statistics are stored in DB2 catalog tables.

 

  Q102)       When will you chose to run RUNSTATS?

  A102)        After a load, or after mass updates, inserts, deletes, or after REORG.

 

  Q103)       Give some example of statistics collected during RUNSTATS?

  A103)        Number of rows in the table, Percent of rows in clustering sequence, Number of distinct values of indexed column, Number  of rows moved to a nearby/fairway page due to row length increase

 

  Q104)       What is REORG?  When is it used?

  A104)        REORG reorganizes data on physical storage to reclutser rows, positioning overflowed rows in their proper sequence, to reclaim space, to restore free space.  It is used after heavy updates, inserts and delete activity and after segments of a segmented tablespace have become fragmented.

 

  Q105)       What is IMAGECOPY ? - GS

  A105)        It is full backup of a DB2 table which can be used in recovery.

 

  Q106)       When do you use the IMAGECOPY? - GS

  A106)        To take routine backup of tables, After a LOAD with LOG NO and After REORG with LOG NO

 

  Q107)       What is COPY PENDING status?

  A107)        A state in which, an image copy on a table needs to be taken,  In this status, the table is available only for queries.  You cannot update this table.  To remove the COPY PENDING status, you take an image copy or use REPAIR utility.

 

  Q108)       What is CHECK PENDING ?

  A108)        When a table is LOADed with ENFORCE NO option, then the table is left in CHECK PENDING status.  It means that the LOAD utility did not perform constraint checking.

 

  Q109)       What is QUIESCE?

  A109)        A QUIESCE flushes all DB2 buffers on to the disk. This gives a correct snapshot of the database and should be used before and after any IMAGECOPY to maintain consistency.

 

  Q110)       What is a clustering index ? - GS

  A110)        Causes the data rows to be stored in the order specified in the index. A mandatory index defined on a partitioned table space.

 

  Q111)       How many clustering indexes can be defined for a table?

  A111)        Only one.

 

  Q112)       What is the difference between primary key & unique index ?

  A112)        Primary Key: a relational database constraint.  Primary key consists of one or more columns that uniquely identify a row in the table.  For a normalized relation, there is one designated primary key.  

Unique index: a physical object that stores only unique values.  There can be one or more unique indexes on a table.

 

  Q113)       What is sqlcode -922 ?

  A113)        Authorization failure

 

  Q114)       What is sqlcode -811?

  A114)        SELECT statement has resulted in retrieval of more than one row.

 

  Q115)       What does the sqlcode of -818 pertain to? - GS

  A115)        This is generated when the consistency tokens in the DBRM and the load module are different.

 

  Q116)       Are views updatable ?

  A116)        Not all of them.  Some views are updatable e.g. single table view with all the fields or mandatory fields. Examples of non-updatable views are views which are joins, views that contain aggregate functions (such as MIN), and views that have GROUP BY clause.

 

  Q117)       If I have a view which is a join of two or more tables, can this view be updatable? - GS

  A117)        No.

 

  Q118)       What are the 4 environments which can access DB2 ?

  A118)        TSO, CICS, IMS and BATCH

 

  Q119)       What is an inner join, and an outer join ?

  A119)        Inner Join: combine information from two or more tables by comparing all values that meet the search criteria in the designated column or columns of one table with all the values in corresponding columns of the other table or tables.  This kind of join which involve a match in both columns are called inner joins.

 

Outer join : Is one in which you want both matching and non matching rows to be returned. DB2 has no specific operator for outer joins, it can be simulated by combining a join and a correlated sub query with a UNION.

 

  Q120)       What is FREEPAGE and PCTFREE in TABLESPACE creation?

  A120)        PCTFREE: percentage of each page to be left free  

FREEPAGE: Number of pages to be loaded with data between each free page

 

  Q121)       What are simple, segmented and  partitioned table spaces ?

  A121)        Simple Tablespace:              Can contain one or more tables.  Rows from multiple tables can be interleaved on a page  

                                                                under the DBA’s control and maintenance

Segmented Tablespace:      Can contain one or more tables. Tablespace is divided into segments of 4 to 64 pages in

increments of 4 pages.  Each segment is dedicated to single table.  A table can occupy

multiple segments

Partitioned Tablespace:      Can contain one table. Tablespace is divided into parts and each part is put in a separate

VSAM dataset.

 

  Q122)       What is filter factor?

  A122)        One divided by the number of distinct values of a column.

 

  Q123)       What is index cardinality? - GS

  A123)        The number of distinct values a column or columns contain.

 

  Q124)       What is a synonym ?

  A124)        Synonym is an alternate name for a table or view used mainly to hide the leading qualifier of a table or view.. A synonym is accessible only by the creator.

 

  Q125)       What is the difference between SYNONYM and ALIAS?

  A125)        SYNONYM           : is dropped when the table or tablespace is dropped. Synonym is available only to the creator.  

ALIAS                   : is retained even if table or tablespace is dropped.  ALIAS can be created even if the table does

   not exist. It is used mainly in distributed environment to hide the location information from

   programs. Alias is a global object & is available to all.

 

  Q126)       What do you mean by NOT NULL WITH DEFAULT?  When will you use it?

  A126)        This column cannot have nulls and while insertion, if no value is supplied then it will have zeroes, spaces or date/time depending on whether it is numeric, character or date/time.Use it when you do not want to have nulls but at the same time cannot give values all the time you insert this row.

 

  Q127)       What do you mean by NOT NULL?  When will you use it?

  A127)        The column cannot have nulls.  Use it for key fields.

 

  Q128)       When would you prefer to use VARCHAR?

  A128)        When a column which contains long text, e.g. remarks, notes, may have in most cases less than 50% of the maximum length.

 

  Q129)       What are the disadvantages of using VARCHAR?

  A129)        Can lead to high space utilization if most of the values are close to maximum.

Positioning of VARCHAR column has to be done carefully as it has performance implications.

Relocation of rows to different pages can lead to more I/Os on retrieval.

 

  Q130)       How do I create a table MANAGER (EMP-NO, MANAGER) where MANAGER is a foreign key which references to EMP-NO in the same table?  Give the exact DDL.

  A130)        First CREATE MANAGER table with EMP-NO as the primary key.  Then ALTER it to define the foreign key.