Deduplication > Deduplication Size Savings

Article:HOWTO71900  |  Created: 2012-02-08  |  Updated: 2012-02-08  |  Article URL http://www.symantec.com/docs/HOWTO71900
Article Type
How To


Environment

Subject


Deduplication > Deduplication Size Savings

Most NetBackup customers have either implemented some sort of deduplication or plan to in the near future. This is the current trend as customers try to get control of infrastructure and the amount of time backups take. Add to this the need for many customers to protect remote offices across smaller WAN links and deduplication becomes critical since only so much data can be pushed through a small pipe and many times there is no one at the remote site who is capable of managing hardware. In the data center, deduplication reporting helps to determine what the actual deduplication rate is occurring at each PureDisk storage unit. Reporting can be abstracted to view the protected size or stored size across all PureDisk environments or drilled down in to one specific area. Common file or data types can then be identified and matched with their ability to be deduplicated. The report below is a simple dedupe report created using a Media Server Deduplication Pool. It is a very basic report and shows how much data would have been backed up without deduplication enabled. In large environments this would show the number of GB or TB that is NOT being used due to deduplication. The other reports can contrast the dedupe data savings vs. the actual deduped information however in this very small test environment this report shows better information. OpsCenter and OpsCenter Analytics provide the ability to report on the various forms of deduplication available with NetBackup. It can provide reports that show things such as how much deduplication is being done at a remote site before the data hits the WAN pipe. Using the trending feature of Opscenter Analytics can also help to understand if the data is growing faster than the hardware or when the WAN bandwidth can be expected to max out. This allows the company to make educated decisions on when to add infrastructure proactively instead of waiting for things to fail. This report shows the total number of bytes that do not need to be backed up because of deduplication. There is no calculation or ratio in this number. It is the number of bytes (MB, GB, TB etc.) that have not changed since the last backup and therefore do not need to be backed up. This report shows only deduplication jobs. The default Report On parameter for this report is Master Server. This report is available in Historical, and Ranking views. The ranking report groups the total number of bytes saved per host being backed up to show the top five hosts in terms of deduplication savings.

The historical report shows the total number of bytes that do not need to be backed up because of deduplication. There is no calculation or ratio in this number. It is the number of bytes (MB, GB, TB etc.) that have not changed since the last backup and therefore do not need to be backed up. This report shows only deduplication jobs. The default Report On parameter for this report is Master Server

Figure: Deduplication Size Savings Report (Historical) shows a sample view of Deduplication Size Savings Report.

Figure: Deduplication Size Savings Report (Historical)

Deduplication Size Savings Report (Historical)

The ranking report shows the number of bytes that have not changed within the time frame and therefore do not have to be backed up due to deduplication. The report groups the total number of bytes saved per host being backed up to show the top five hosts in terms of deduplication savings. This report shows only deduplication jobs. The default Report On parameter for this report is Master Server.

Figure: Deduplication Size Savings Report (Ranking) shows a sample view of Deduplication Size Savings Report.

Figure: Deduplication Size Savings Report (Ranking)

Deduplication Size Savings Report (Ranking)

The following are the details that are associated with the Deduplication Size Savings report:

Report Name

Deduplication Size Savings

Report Category

Deduplication

Where to Locate

Report > Report Templates > Backup > Deduplication > Deduplication Size Savings.

Report Purpose

Historical trending report showing savings in storage due to deduplication.

Supported products

Symantec NetBackup, Symantec NetBackup Master Appliance, Symantec PureDisk

Applicable View

Master Server, Client

Report Display

Historical, Ranking

Drill down report

Yes. Drill down reports can be viewed at following four levels depending on the selected parameter value. Level 1: Client Name, Level 2: Job Type, Level 3: Schedule/Level Type, Level 4: Policy Name

Can be scheduled

Yes

Supported formats

PDF, CSV, HTML, XML, TSV

Mapping with VBR report

All Reports > Standard Report > Backups > Deduplication > Size Savings

You can edit the Deduplication Size Savings report to show or hide the columns that you are interested in. All the columns are shown by default. You can also filter the report to view reports of specific master servers. The following table shows the filters for Deduplication Size Savings report and how you can use these filters to view specific data in the report.

Filter

Column

Description

View

 

Select a view to show data for the selected view in the report. By default All My Views is selected.

Master Server

Master Server

Select one or more master servers to view data for the virtual clients that are backed up by the selected master servers in the report. All master servers that are added to the OpsCenter console are shown in the Value drop-down list .

Media Server

Media Server

Select one or more media servers to view data for the selected media servers in the report. All media servers that are added to the OpsCenter console are shown in the Value dropdown list.

Client

Client Name

Client OS

Select one or more clients to view data for the selected clients in the report. All clients that are protected by the master servers in the OpsCenter console are listed.

Select one or more client OS to view data for the selected client OS in the report.

Policy

Policy Name

Policy Type

Policy Keyword

Policy Data Classification Name

Select one or more policy names to view data for the selected policies.

Select one or more policy types to view data for the selected policy types.

Select one or more policy keyword to view data for the policies having the selected keyword.

Select one or more policy data classification names like Gold, Silver, Bronze etc. to view data for the policies having the selected data classiciation names.

Schedule

Schedule/Level Type

Schedule Name

Select one or more schedule or level types to view the data for the configured schedules.

Select one or more schedule names to view data for the selected schedule names.

Job

Job Type

Product Type

Job Transport Type

Job Status

Select one or more job types as Archive, Backup, or Restore to view data for the selected job types.

Select one or more backup products to view data for the selected products.

Select one or more transport types like FT, LAN, or Unknown to view data for the jobs that have the selected transport type.

Select one or more job status like Failed, Successful etc. to view data for the selected job status.

Example of filter usage: You may run the Duplication Size Savings report for a specific client to view details for the specific clients

The following report enhancements have been made for the Duplication Size Savings report:

  • Drill down reports for following four levels have been added.

    • Level 1: Client Name

    • Level 2: Job Type

    • Level 3: Schedule/Level Type

    • Level 4: Policy Name

  • The report now shows trendlines with customized average periods with option: Show trendline with moving average period of 3. The value can be 1, 2, 3, 7, 14, or 30. The default value is 3. Example: 3 displays a trendline that, at each interval, shows the average of the current data point and the two previous data points.

  • The unit Years has been added for Relative Timeframe selection. You can now view data for the last x years.

  • The option Next has been removed from the drop-down menu for Relative Timeframe. You now cannot enter a relative timeframe like Next 2 weeks. This is because the report does not support any futuristic data.

  • The following options have been removed from TimeFrameGroupBy selection:

    • Days of week Average

    • Days of week total

    • Hours of day Average

    • Hours of day Total

    • Minutes

    • Month of year Average

    • Month of year total

    • Quarter of year Average

    • Quarter of year total

The following filter enhancements have been made for the Duplication Size Savings report:

  • New columns Job Status (Complete Only) and Job Status(Complete and Partial Only) are added to the Report On parameter.

  • UNKNOWN has been added as a value for Policy Type column in the Policy filter. You can now filter report data for policy types that are UNKNOWN.

  • Job types which are not applicable for this report are removed from the filter. The supported job types are Backup, Archive, Restore, Catalog Backup, Backup from Snapshot, Snapshot, Duplicate, and Replication.

  • The default selected value for Job Type column is changed to Backup.

  • Symantec Enterprise Vault, IBM Tivoli Storage Manager,EMC NetWorker, Symantec Backup Exec,and CommVault Galaxy have been removed from the list of values for column Product Type under Job filter.

  • The default unit for Y1 axis is now GB.

  • Value SAN is removed from the Job Transport Type filter.

Known Issues:

  • Selecting Reporton Storage Unit name in OpsCenter does not result in any data for Symantec PureDisk because Storage Unit is not present in PureDisk .In VBR, jobs data collected from PureDisk puts the storage unit name incorrectly as "SPA".

  • As compared to VBR, the unit of data for Y axis/Y1 axis is one unit lower. For example: Unit GB in VBR matches up with unit MB in OpsCenter. OpsCenter shows the correct data.


Legacy ID



v69077906_v71396211


Article URL http://www.symantec.com/docs/HOWTO71900


Terms of use for this information are found in Legal Notices