Inconsistent Error Handling in `tf.raw_ops.SparseSegmentSumGradV2` Between CPU and GPU Implementations #94151

SilentTester73 · 2025-05-26T16:54:29Z

Issue type

Bug

Have you reproduced the bug with TensorFlow Nightly?

Yes

Source

binary

TensorFlow version

tf 2.20.0-dev20250526

Custom code

Yes

OS platform and distribution

Linux CPU & GPU

Mobile device

No response

Python version

3.13.2

Bazel version

No response

GCC/compiler version

No response

CUDA/cuDNN version

CUDA 12.5.1

GPU model and memory

No response

Current behavior?

The tf.raw_ops.SparseSegmentSumGradV2 function exhibits inconsistent behavior between CPU and GPU implementations.

When provided with out-of-range negative index, the CPU version raise an exception normally while the GPU version acquiesces to negative indexing or causes an abort.

Standalone code to reproduce the issue

import tensorflow as tf

with tf.device('/cpu:0'):
    try:
        grad = tf.constant([0, 0, 0, 0], dtype=tf.float64, shape=[4])
        indices = tf.constant([-1], dtype=tf.int64, shape=[1])
        segment_ids = tf.constant([-1], dtype=tf.int64, shape=[1])
        dense_output_dim0 = tf.constant([1], dtype=tf.int32, shape=[])

        tf.raw_ops.SparseSegmentSumGradV2(
            grad=grad,
            indices=indices,
            segment_ids=segment_ids,
            dense_output_dim0=dense_output_dim0,
            name=None
        )
        print("SparseSegmentSumGradV2 executed successfully on CPU")
    except Exception as e:
        print(f"Exception on CPU: {e}")
        
with tf.device('/gpu:0'):
    try:
        grad = tf.constant([0, 0, 0, 0], dtype=tf.float64, shape=[4])
        indices = tf.constant([-1], dtype=tf.int64, shape=[1])
        segment_ids = tf.constant([-1], dtype=tf.int64, shape=[1])
        dense_output_dim0 = tf.constant([1], dtype=tf.int32, shape=[])

        tf.raw_ops.SparseSegmentSumGradV2(
            grad=grad,
            indices=indices,
            segment_ids=segment_ids,
            dense_output_dim0=dense_output_dim0,
            name=None
        )
        print("SparseSegmentSumGradV2 executed successfully on GPU")
    except Exception as e:
        print(f"Exception on GPU: {e}")

Relevant log output

Output:


I0000 00:00:1748278378.253343  624834 gpu_device.cc:2018] Created device /job:localhost/replica:0/task:0/device:GPU:0 with 2033 MB memory:  -> device: 0, name: NVIDIA GeForce RTX 4050 Laptop GPU, pci bus id: 0000:01:00.0, compute capability: 8.9
2025-05-27 00:52:58.280430: I tensorflow/core/framework/local_rendezvous.cc:407] Local rendezvous is aborting with status: INVALID_ARGUMENT: Index -1 out of range [0, 1).
Exception on CPU: {{function_node __wrapped__SparseSegmentSumGradV2_device_/job:localhost/replica:0/task:0/device:CPU:0}} Index -1 out of range [0, 1). [Op:SparseSegmentSumGradV2] name: 
SparseSegmentSumGradV2 executed successfully on GPU

The text was updated successfully, but these errors were encountered:

sharktide · 2025-05-26T18:07:29Z

We could probably include a check for negaive indicies before doing continuing with the job. To me, this seems like a we-overlooked-this kinda issue.

Venkat6871 · 2025-05-29T12:47:57Z

I was able to reproduce the same issue using TensorFlow 2.19.0 as well as the nightly version. Please find the gist for your reference.
Thank you!

google-ml-butler bot added the type:bug Bug label May 26, 2025

google-ml-butler bot assigned tilakrayal May 26, 2025

Venkat6871 added comp:ops OPs related issues TF 2.19 labels May 26, 2025

Venkat6871 assigned Venkat6871 and unassigned tilakrayal May 26, 2025

SilentTester73 mentioned this issue May 28, 2025

Inconsistent Error Handling in tf.raw_ops.SparseSegmentSqrtNGradV2 Between CPU and GPU Implementations #94376

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Inconsistent Error Handling in `tf.raw_ops.SparseSegmentSumGradV2` Between CPU and GPU Implementations #94151

Inconsistent Error Handling in `tf.raw_ops.SparseSegmentSumGradV2` Between CPU and GPU Implementations #94151

SilentTester73 commented May 26, 2025

sharktide commented May 26, 2025

Uh oh!

Venkat6871 commented May 29, 2025

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Inconsistent Error Handling in tf.raw_ops.SparseSegmentSumGradV2 Between CPU and GPU Implementations #94151

Inconsistent Error Handling in tf.raw_ops.SparseSegmentSumGradV2 Between CPU and GPU Implementations #94151

Comments

SilentTester73 commented May 26, 2025

Issue type

Have you reproduced the bug with TensorFlow Nightly?

Source

TensorFlow version

Custom code

OS platform and distribution

Mobile device

Python version

Bazel version

GCC/compiler version

CUDA/cuDNN version

GPU model and memory

Current behavior?

Standalone code to reproduce the issue

Relevant log output

sharktide commented May 26, 2025

Uh oh!

Venkat6871 commented May 29, 2025

Uh oh!

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Inconsistent Error Handling in `tf.raw_ops.SparseSegmentSumGradV2` Between CPU and GPU Implementations #94151

Inconsistent Error Handling in `tf.raw_ops.SparseSegmentSumGradV2` Between CPU and GPU Implementations #94151