KB ID: | 1681 |
Product: | Veeam Backup & Replication | 9.5 | 10 | 11 | 12 | 12.1 | 12.2 | 12.3 | 12.3.1 |
Published: | 2012-10-04 |
Last Modified: | 2025-03-11 |
Languages: | JP |
Now you’re less likely to miss what’s been brewing in our knowledge base with this weekly digest
Please, try again later.
Veeam does not remove the snapshot itself, Veeam sends an API call to VMware to have the action performed.
The snapshot removal process significantly lowers the total IOPS that can be delivered by the VM because of additional locks on the VMFS storage due to the increase in metadata updates, as well as the added IOP load of the snapshot removal process itself. In most environments, if you are already over 30-40% IOP load for your target storage, which is not uncommon with a busy SQL/Exchange server, then the snapshot removal process will easily push that into the 80%+ mark and likely much higher. Most storage arrays will see a significant latency penalty once IOP's get into the 80%+ mark which will of course be detrimental to application performance.
The following test should be performed when connectivity to the VM is not sensitive, for instance, during off-peak hours.
To isolate the VMware snapshot removal event, Veeam suggests the following isolation test:
While performing the test above, if you observe the same connectivity issues as during the Veeam job run, the issue likely exists within the VMware environment itself. Review the following list of troubleshooting steps and known issues. If none of the following work to resolve the issue, we advise that you contact VMware support directly regarding the snapshot removal issue.
The VMware article regarding this issue has been updated to indicate that the underlying issue causing the snapshot stun issue with NFS 3.0 and Virtual Appliance (HotAdd) transport mode was resolved in the VMware ESXi 8.0 Update 2b release.
Symptom: When the backup job completes and the snapshot removal request is processed, the Guest OS of the VM that was processed will experience multiple minutes of stun. However, isolation testing with manual snapshot creation and removal will only result in mere seconds of stun.
In ESXi versions prior to 8.0 U2b, there is a known issue with NFS 3.0 Datastores and Virtual Appliance (HotAdd) transport mode. The issue is documented in Broadcom KB323118: Virtual machines residing on NFS storage become unresponsive during a snapshot removal operation:
"This issue occurs when the target virtual machine and the backup appliance [proxy] reside on two different hosts, and the NFSv3 protocol is used to mount NFS datastores. A limitation in the NFSv3 locking method causes a lock timeout, which pauses the virtual machine being backed up [during snapshot removal]."
The recommended solution is to utilize Direct NFS Access transport mode, which will both improve backup performance and eliminate the risk of snapshot stun associated with NFS 3.0.
By default, the VMware Backup Proxy, when set to automatic transport mode selection, will attempt to utilize transport modes in the following priority:
Therefore, if the VMware Backup Proxy is attempting to utilize Virtual Appliance (HotAdd) to process a disk on an NFS-backed Datastore, this would indicate that environmental configuration has prevented Direct NFS Access from being selected. Review the following:
NFS client
package installed.nfs-common
package installed.nfs-utils
package installed.If implementing the Direct NFS Access transport mode is not feasible, either of the following workarounds may be used to alter the backup transport usage to work around the NFS 3.0 limitation. Both workarounds will result in functional backups but will not provide the simplicity of use or performance of correctly configuring Direct NFS Access transport mode.
Note: This workaround involves having the Veeam Backup & Replication software detect which host the VM that will be backed up is located on and assign the VMware Backup Proxy on that same host to perform disk processing.
This workaround restricts the transport mode selection to Network Mode, which can severely impact backup performance in environments where 10Gb networking is not in use. The vSphere environment assigns a lower priority to Network Mode traffic, and in environments using 1Gb networking, this can result in Network Mode traffic being throttled significantly.
Your feedback has been received and will be reviewed.
Please, try again later.
Please try select less.
This form is only for KB Feedback/Suggestions, if you need help with the software open a support case
Your feedback has been received and will be reviewed.
Please, try again later.