Quickie Blog – DBCC PAGE… WITH TABLERESULTS?!?

December 21, 2023Andy (@SQLBek) 1 Comment

Welcome back to another quick blog, about something I just learned about.

Am playing around with Always Encrypted for the first time. I was just following along the basic tutorial and encrypted some columns in my AutoDealershipDemo database. But then I decided to go crack open the data page using my friend DBCC PAGE.

Text output of DBCC PAGE (15, 1, 1218323, 3)

WITH TABLERESULTS

Because I only ever use DBCC PAGE() once every so often, I did a quick search to refresh on the syntax. And what I found was someone who used something I’d never seen before…

DBCC PAGE(aa, bb, cc, dd) WITH TABLERESULTS

“WITH TABLERESULTS”? What is this…?

DBCC PAGE(15, 1, 1218323, 3) WITH TABLERESULTS

WOW – for me, this makes the output SO much easier to consume!!!

And what of that encrypted data?

And if you’re curious, here’s what I was really after.

See how I had encrypted the VIN, InvoicePrice, MSRP, and DateReceived columns in my test table. Pretty cool to see it scrambled up and interesting to see how it consumed a heck of a lot more space in-row.

DBCC Commands

Another quick search tells me that you can also use “WITH TABLERESULTS” with other DBCC commands too. I didn’t look much more into it since I got what I wanted, but thought I’d share this neat tidbit.

Thanks for reading!

Quickie Blog – Page Compression & Heaps

December 20, 2023Andy (@SQLBek) Leave a comment

Found an interesting tidbit while working on a compression diagnostic script, and thought I’d write a quick blog.

Was refreshing myself on all of the output of sys.dm_db_index_physical_stats and the nuances, when I found an interesting tidbit about “compressed_page_count.”

For heaps, newly allocated pages aren’t PAGE compressed.

A heap is PAGE compressed under two special conditions: when data is bulk imported or when a heap is rebuilt. Typical DML operations that cause page allocations aren’t PAGE compressed.

Rebuild a heap when the compressed_page_count value grows larger than the threshold you want.
https://learn.microsoft.com/en-us/sql/relational-databases/system-dynamic-management-views/sys-dm-db-index-physical-stats-transact-sql?view=sql-server-ver16

Well then… moral of the story folks, is that if you have heaps that are PAGE compressed, you may not be getting the benefits you think you’re getting if you’re not executing rebuilds regularly!

Thanks for reading.

sp_helpExpandView and Triggers

December 6, 2023Andy (@SQLBek) 1 Comment

Recently someone asked my wife Deborah if she knew if my sp_helpExpandView utility procedure could also work with triggers. Great question – I rarely work with triggers, especially nowadays, so I had no idea… and finally found some time and motivation to dig into it deeper.

TL;DR – How Does sp_helpExpandView Work with Triggers?

If you run sp_helpExpandView whose parameter IS a trigger, you’ll get information about the objects that the trigger references. However, you will not get information about the table that the trigger is associated with (which could be derived from another query).
If you run sp_helpExpandView whose parameter is a table that HAS a trigger, you will NOT get any information about that trigger being present. You’ll have to write a different query for that.
If you run sp_helpExpandView whose parameter is a table that is UTILIZED BY a trigger, you WILL get information about that trigger.

Digging Deeper

To dig deeper, I set up a quick prototype with two tables and a basic INSERT trigger.

USE TempDB;

GO

CREATE TABLE dbo.TableWithADMLTrigger (

	RecID INT IDENTITY(1, 1) PRIMARY KEY CLUSTERED,

	MyValue VARCHAR(50),

	MyTimestamp DATETIME DEFAULT(GETDATE())

);



CREATE TABLE dbo.TriggerAuditTable (

	RecID INT IDENTITY(1, 1) PRIMARY KEY CLUSTERED,

	InsertedRecID INT,

	MyValue VARCHAR(50),

	MyTimestamp DATETIME 

);

GO



CREATE OR ALTER TRIGGER dbo.tr_InsertTriggerForAuditing 

	ON dbo.TableWithADMLTrigger

AFTER INSERT

AS

BEGIN

	INSERT INTO dbo.TriggerAuditTable (

		InsertedRecID, MyValue, MyTimeStamp

	)

	SELECT inserted.RecID, inserted.MyValue, inserted.MyTimestamp

	FROM inserted

END

GO



INSERT INTO dbo.TableWithADMLTrigger (MyValue) VALUES ('alpha');

GO 3



SELECT *

FROM dbo.TableWithADMLTrigger;



SELECT *

FROM dbo.TriggerAuditTable;

GO

Now, let’s try running sp_helpExpandView against each entity.

EXEC sp_helpExpandView 'dbo.TableWithADMLTrigger';
GO

EXEC sp_helpExpandView 'dbo.TriggerAuditTable';
GO

EXEC sp_helpExpandView 'dbo.tr_InsertTriggerForAuditing';
GO

The first two commands will yield nothing. Here’s the output of the third, against the trigger itself. Note that it shows you the dbo.TriggerAuditTable that is used by the trigger, but not the table that the trigger itself is defined on – dbo.TableWithADMLTrigger.

The reason for this has to do with the underlying code’s use of sys.dm_sql_referenced_entities(). There are actually two of these DMFs in SQL Server, sys.dm_sql_referenced_entities() and sys.dm_sql_referencing_entities(). Note the critical difference, referencED and referencING.

Confused? Me too – which is why I have to re-explain the difference to myself every time I revisit this topic. Hopefully this summary helps you.

Referenced Entities: Return all objects that are referenced by the object parameter passed in.

ex: dbo.MyStoredProcedure that has a SELECT * FROM dbo.MyTable, dbo.MyTable is referenced by the stored procedure and would appear in the resultset from a query against sys.dm_sql_referenced_entities()
Referencing Entities: Returns all objects that are referencing the object parameter passed in.

ex: If dbo.MyTable is passed, then dbo.MyStoredProcedure would appear in the resultset from a query against sys.dm_sql_referencing_entities()

Interestingly enough, not all objects are supported by both DMFs. See the Remarks section of the documentation of either DMF for a full table. But the consequence for sp_helpExpandView is that if you use sp_helpExpandView against tables that are both referenced by or referencing a trigger, it will not yield any output.

Partial Solution via sys.dm_sql_referencing_entities()

You can use sys.dm_sql_referencing_entities() to see triggers that reference a table. Here’s more example code.

DECLARE @TableName NVARCHAR(256) = 'dbo.TriggerAuditTable';



SELECT-- DISTINCT

	DB_NAME() AS database_name,

	@TableName AS table_name,

	COALESCE(schemas.name, dm_sql_referencing_entities.referencing_schema_name) + '.' + dm_sql_referencing_entities.referencing_entity_name AS object_referencing_this_table, 

	COALESCE(schemas.type, NULL) AS referencing_object_type,

	dm_sql_referencing_entities.referencing_id AS referencing_object_id

FROM sys.dm_sql_referencing_entities(@TableName, 'OBJECT')

LEFT OUTER JOIN (

	SELECT objects.object_id, schemas.name, objects.type

	FROM sys.objects

	INNER JOIN sys.schemas

		ON objects.schema_id = schemas.schema_id

) schemas

	ON dm_sql_referencing_entities.referencing_id = schemas.object_id;

You can combine the above plus code that returns all triggers defined on a given table, to give you a bigger picture of trigger impact and relationships. Someday I may try to integrate this into sp_helpExpandView. Or I would absolutely welcome and credit anyone who wants to modify the code and issue a Pull Request on Github.

Thanks for reading.

Alternative to RAND() for Random Number Generation

December 4, 2023Andy (@SQLBek) 1 Comment

This is a quick blog to “document” a T-SQL technique for generating random numbers. I’ve been using this for years, but don’t use it frequently enough to have it fully memorized. So whenever I do need it, I must constantly have to go look up whenever I need to use it.

TL;DR

CHECKSUM(NEWID())

SELECT a bunch of Random Numbers

Let’s say you need to generate random numbers as part of a resultset. Here’s a simple example.

/* Use RAND() */
SELECT TOP 10 

	name,

	RAND()

FROM sys.databases;

Unfortunately, if you use RAND(), you’ll get the same value for each record in the resultset. But what if I needed DIFFERENT random values for each record?

There’s a few different approaches you can take but here’s my favorite that I think is the cleanest:

/* Use CHECKSUM(NEWID()) */
SELECT TOP 10 

	name,

	CHECKSUM(NEWID())

FROM sys.databases;

GO 3  -- to loop the prior batch 3 times

Now we get a different random value per record AND different random values per execution!

At this point, you can use whatever other method to limit it down to values that you might want. I use the modulus % operator regularly for this.

/* Generate a random value between 1 and 10 */
SELECT TOP 10 

	name,

	ABS((CHECKSUM(NEWID()) % 10)) + 1

FROM sys.databases;

GO 2

Notice that I added one additional function to the column definition – ABS(). That’s because as you can see in the prior examples, the raw output of CHECKSUM(NEWID()) will return both positive and negative integer values. So if I wanted only positive values between 1 and 10, using ABS() to get the absolute value is a clean solution.

Hope someone else finds this helpful! Thanks for reading.

Backup Internals – Part 5: Balancing Performance

December 1, 2023Andy (@SQLBek) 3 Comments

Welcome back to Part 5 of my Backup Internals Series. In this blog, I want to explore some thoughts around maximizing the performance of your SQL Server backup operations.

Blog Series Table of Contents

Fast, Good, or Cheap…

Many of us have heard the tidbit that there’s 3 options to any endeavor: Fast, Good, and Cheap, but you can only ever pick 2. When it comes to accelerating your (FULL) backup performance, there’s a similar set of choices you can make, most of which have a trade-off of some sort.

Gather Requirements

If I were to have an opportunity to re-evaluate an environment’s backup strategy and schedule, there’s a number of questions I’d want to try to answer first. Remember, the scope of this thought exercise is thinking about FULL Backup strategy from a purely performance angle.

Questions to Ask

Backup maintenance windows: How long and how frequently?
Every night for 4 hours? Anytime over the weekend? For 3 minutes on Sunday, between 22:55 and 22:59?
How many databases to back up and how large are they?
Does the server just have 1 database, but it’s 15TB in size? Or do you have 800 databases, one per customer? And of those 800, what’s the individual size distribution? Maybe 700 databases are smaller than 100GB, another 75 databases are between 100GB and 500GB, and the last 25 databases are +500GB?
Exclusivity during backup maintenance window?
Do you have the luxury of no workload running during your maintenance window? Or do you also have to juggle index maintenance, DBCC CHECKDB, or other application processes like ETLs, nightly processing jobs, etc.? Or to put it another way, do you have free reign to max out your SQL Server resources during your window?

Assumptions

1 SQL Server with many databases
CPU = 12 cores; RAM = 256GB
Backup storage capacity and storage ingest throughput are not a concern (you bought “orange”)
Will ignore RPO/RTO business priorities (which I might tackle in a future blog)

Start Doing Math

Now that you know how much time you have and how many/how much you need to back up, you need to start weighing your options. Do you have the luxury to run one single backup job that will back up your databases one after another? Or do you need to split your databases across multiple backup jobs that run in parallel?

If server resources were infinite, one could theoretically kick off an individual backup job for each and every database, all starting at the exact same time. But of course, that makes no sense.

Revisiting What We’ve Learned

We now know that if our database’s data file(s) all reside on a single volume, we’ll only ever get one Reader Thread. So with a 12 core server, if I have full usage of the server’s resources, I may start with 8 backup output files to get 8 Writer Threads. And I might choose to use a large BUFFERCOUNT value and larger MAXTRANSFERSIZE value, which will result in more RAM consumption.

Now let’s pretend that to meet our requirements, we need to run backups in parallel. You might estimate that you need to run 4 backup jobs simultaneously. If you use the above parameters, you’ll may now start overrunning your CPU! Remember it’s not always about 100% CPU utilization either… all cores could be utilized at say 30%, but you could be context switching like crazy.

Who Doesn’t Love a Trace Flag?

So of course the above means you need to test different permutations. But when you’re doing testing, how can you determine the actual resource utilization of a given backup job? This is where some infrequently highlighted trace flags come into play.


Trace Flag 3213 - Generate diagnostic data about BACKUP & RESTORE operation (least)

Trace Flag 3004 - Generate diagnostic data about BACKUP & RESTORE operation (more)

Trace Flag 3014 - Generate diagnostic data about BACKUP & RESTORE operation (all)

These are officially undocumented



Trace Flag 3604 - Redirects output information to SSMS Results Pane

Trace Flag 3605 - Redirects output information to SQL Server Error Log: use if you need timestamps

You’ll see the above three Trace Flag are suffixed with least, more, and all. That’s because the output that each Trace Flag yields seems to have some overlap. And what’s more challenging is that I’ve found inconsistent documentation in older blogs that cover these Trace Flags as well. So I just opt to use them all to cover all of my bases.


-- Turn on Trace Flags

DBCC TRACEON(3604, 3004, 3014, 3213, -1);

GO

When turned on, you will receive a ton of diagnostic information after running a backup operation. Here’s an example (with prefix removed for brevity):

BACKUP DATABASE Sandbox_MultiFile_SingleVol TO

	DISK='NUL'

WITH COPY_ONLY, FORMAT, INIT, STATS = 15

GO 



--------------

[prefix]: BACKUP DATABASE started

[prefix]: Opening the database with S lock

[prefix]: Acquiring bulk-op lock on the database

[prefix]: Synchronizing with other operations on the database is complete

[prefix]: Opening the backup media set

[prefix]: The backup media set is open



Backup/Restore buffer configuration parameters



Memory limit:               32765 MB

BufferCount:                7

Sets Of Buffers:            1

MaxTransferSize:            1024 KB

Min MaxTransferSize:        64 KB

Total buffer space:         7 MB

Tabular data device count:  1

Fulltext data device count: 0

Filestream device count:    0

TXF device count:           0

Filesystem i/o alignment:   512

Media Buffer count:         7

Media Buffer size:          1024 KB



[prefix]: Preparing the media set for writing

[prefix]: The media set is ready for backup



[prefix]: Effective options: Checksum=0, Compression=0, Encryption=0, BufferCount=7, MaxTransferSize=1024 KB

[prefix]: Checkpoint LSN: 0:0:0

[prefix]: Checkpoint is complete (elapsed = 17 ms)

[prefix]: Start LSN: 3246:76679:230, SERepl LSN: 0:0:0

[prefix]: Last LSN: 3246:76775:1

[prefix]: Scanning allocation bitmaps

[prefix]: Data section: 204684263424 bytes in total

[prefix]: Scanning allocation bitmaps is complete



[prefix]: Calculating expected size of total data

[prefix]: FID=1, ExpectedExtents=780610, IsDifferentialMapAccurate=1

[prefix]: FID=3, ExpectedExtents=781546, IsDifferentialMapAccurate=1

[prefix]: FID=4, ExpectedExtents=780523, IsDifferentialMapAccurate=1

[prefix]: FID=5, ExpectedExtents=780555, IsDifferentialMapAccurate=1

[prefix]: TotalSize=204684263424 bytes

[prefix]: Estimated total size = 204684312576 bytes (data size = 204684263424 bytes, log size = 49152 bytes)



[prefix]: Work estimation is complete



[prefix]: Do the first force checkpoint before copying data section

[prefix]: Checkpoint LSN: 3246:76679:230

[prefix]: Checkpoint is complete (elapsed = 8 ms)

[prefix]: Writing the leading metadata



Shared Backup BufferQ count: 7



[prefix]: Calculating expected size of total data

[prefix]: FID=1, ExpectedExtents=780610, IsDifferentialMapAccurate=1

[prefix]: FID=3, ExpectedExtents=781546, IsDifferentialMapAccurate=1

[prefix]: FID=4, ExpectedExtents=780523, IsDifferentialMapAccurate=1

[prefix]: FID=5, ExpectedExtents=780555, IsDifferentialMapAccurate=1

[prefix]: TotalSize=204684263424 bytes



[prefix]: Copying data files

[prefix]: Data file backup process set to run in serial mode within a volume.

[prefix]: Number of data file readers = 1



15 percent processed.

30 percent processed.

45 percent processed.

60 percent processed.

75 percent processed.

90 percent processed.



[prefix]: InitialExpectedSize=204684263424 bytes, FinalSize=204684263424 bytes, ExcessMode=0



[prefix]: Calculating expected size of total data

[prefix]: FID=1, ExpectedExtents=780610, IsDifferentialMapAccurate=1

[prefix]: FID=3, ExpectedExtents=781546, IsDifferentialMapAccurate=1

[prefix]: FID=4, ExpectedExtents=780523, IsDifferentialMapAccurate=1

[prefix]: FID=5, ExpectedExtents=780555, IsDifferentialMapAccurate=1

[prefix]: TotalSize=204684263424 bytes



[prefix]: Do the second force checkpoint before copying diff section

[prefix]: Checkpoint LSN: 3246:76679:230

[prefix]: Checkpoint is complete (elapsed = 9 ms)

[prefix]: Start pin the log.

[prefix]: Start LSN: 3246:76778:1, SERepl LSN: 0:0:0

[prefix]: Offline the sparse bitmap

[prefix]: Scanning allocation bitmaps

[prefix]: Diff section: 3473408 bytes in total

[prefix]: Scanning allocation bitmaps is complete



[prefix]: Calculating expected size of total data

[prefix]: FID=1, ExpectedExtents=14, IsDifferentialMapAccurate=1

[prefix]: FID=3, ExpectedExtents=13, IsDifferentialMapAccurate=1

[prefix]: FID=4, ExpectedExtents=13, IsDifferentialMapAccurate=1

[prefix]: FID=5, ExpectedExtents=13, IsDifferentialMapAccurate=1

[prefix]: TotalSize=3473408 bytes



[prefix]: Copying data files

[prefix]: Data file backup process set to run in serial mode within a volume.

[prefix]: Number of data file readers = 1

[prefix]: InitialExpectedSize=3473408 bytes, FinalSize=3473408 bytes, ExcessMode=0



[prefix]: Calculating expected size of total data

[prefix]: FID=1, ExpectedExtents=14, IsDifferentialMapAccurate=1

[prefix]: FID=3, ExpectedExtents=13, IsDifferentialMapAccurate=1

[prefix]: FID=4, ExpectedExtents=13, IsDifferentialMapAccurate=1

[prefix]: FID=5, ExpectedExtents=13, IsDifferentialMapAccurate=1

[prefix]: TotalSize=3473408 bytes

[prefix]: Diff section copy finished

[prefix]: Last LSN: 3246:76781:1

[prefix]: Copying data files is complete



Processed 6244992 pages for database 'Sandbox_MultiFile_SingleVol', file 'Sandbox_MultiFile' on file 1.

Processed 6252472 pages for database 'Sandbox_MultiFile_SingleVol', file 'Sandbox_MultiFile2' on file 1.

Processed 6244288 pages for database 'Sandbox_MultiFile_SingleVol', file 'Sandbox_MultiFile3' on file 1.

Processed 6244544 pages for database 'Sandbox_MultiFile_SingleVol', file 'Sandbox_MultiFile4' on file 1.



[prefix]: Copying transaction log

[prefix]: MediaFamily(0): FID=2, VLFID=3246, DataStreamSize=65536 bytes

Processed 1 pages for database 'Sandbox_MultiFile_SingleVol', file 'Sandbox_MultiFile_log' on file 1.

[prefix]: Copying transaction log is complete



[prefix]: Writing the trailing metadata

[prefix]: Writing the end of backup set

[prefix]: Writing history records for NoteBackup

[prefix]: Writing history records for NoteBackup is complete (elapsed = 37 ms)



BACKUP DATABASE successfully processed 24986297 pages in 87.111 seconds (2240.881 MB/sec).

[prefix]: BACKUP DATABASE finished

I’ve grouped together interesting “sub-operations” like when the tail of the log backup is taken. Note the multiple size estimates that are taken throughout as well. Note that there are 4 data files, which is why you see 4 FID entries each time. But also note that all 4 data files are on 1 single volume!

Key TF Output Highlights

Backup/Restore buffer configuration parameters



Memory limit:               32765 MB

BufferCount:                7

Sets Of Buffers:            1

MaxTransferSize:            1024 KB

Min MaxTransferSize:        64 KB

Total buffer space:         7 MB



[prefix]: Effective options: Checksum=0, Compression=0, Encryption=0, BufferCount=7, MaxTransferSize=1024 KB



[prefix]: Number of data file readers = 1

Let’s compare and contrast this to a backup operation with multiple backup output targets (this different database’s data files are spread across 4 different data volumes).

BACKUP DATABASE [Sandbox_MultiFile_MultiVol] TO 

DISK = '\\10.21.200.27\ayun-sql-backups\Sandbox_MultiFile_MultiVol_1.bak', 

DISK = '\\10.21.200.28\ayun-sql-backups\Sandbox_MultiFile_MultiVol_2.bak', 

DISK = '\\10.21.200.70\ayun-sql-backups\Sandbox_MultiFile_MultiVol_3.bak', 

DISK = '\\10.21.200.71\ayun-sql-backups\Sandbox_MultiFile_MultiVol_4.bak', 

DISK = '\\10.21.200.72\ayun-sql-backups\Sandbox_MultiFile_MultiVol_5.bak', 

DISK = '\\10.21.200.73\ayun-sql-backups\Sandbox_MultiFile_MultiVol_6.bak', 

DISK = '\\10.21.200.74\ayun-sql-backups\Sandbox_MultiFile_MultiVol_7.bak', 

DISK = '\\10.21.200.75\ayun-sql-backups\Sandbox_MultiFile_MultiVol_8.bak' 

WITH FORMAT, INIT, STATS = 10, 

MAXTRANSFERSIZE = 2097152, 

BUFFERCOUNT = 500, COMPRESSION;

GO



--------------



Backup/Restore buffer configuration parameters

Memory limit:               98295 MB

BufferCount:                500

Sets Of Buffers:            3

MaxTransferSize:            2048 KB

Min MaxTransferSize:        64 KB

Total buffer space:         3000 MB

Filesystem i/o alignment:   512

Media Buffer count:         500

Media Buffer size:          2048 KB

Encode Buffer count:        500



Backup(Sandbox_MultiFile_MultiVol): Number of data file readers = 4

I think there’s a lot of really cool data in here. Remember that you can use Trace Flag 3605 to log this information to the Error Log. Thusly, you could use that to record what your BACKUP jobs are currently doing and review them later at your leisure (just remember it turn the TF off, lest you bloat your logs).

If you want to see more examples of these Trace Flags in action, visit my GitHub and check out my demo scripts from the original presentation. I introduce the Trace Flags in the 1_Baselining.sql script, then leverage them throughout the other demo scripts.

Workers vs CPU Cores

One more thing to keep in mind are the total number of worker threads you have available on your SQL Server. That’s different than the number of CPU cores you happen to have. I’m not going to dive deeper into this one today but will leave this old but still valid blog from Bob Dorr for reference instead: “How It Works: How many databases can be backed up simultaneously?“

Bringing It All Together

Hopefully now, with the above information, you can see that you really need to do an analysis and essentially create a matrix, based on your requirements and available resources. This will help you determine how much you want to crank up your database tuneables to best accelerate your FULL backups for your environment.

Hope You’ve Enjoyed This Series

This’ll most likely be the final blog of this series, though I might do an “addendum” with random tidbits. Let me know in the comments if there’s anything else you’d like me to explore in an epilogue.