bugfix: Fix schema meta-info serialization issue in HBaseRelation by yuanoOo · Pull Request #94 · oceanbase/spark-connector-oceanbase

yuanoOo · 2026-01-23T06:53:26Z

Summary

Problem: In a distributed Spark environment, the schema metadata (rowKey and columnFamilyMap) was stored in the HBaseRelation companion object. Since Scala companion objects (singleton/static variables) are not serialized and sent to executors, these variables were re-initialized to their default values (empty) on the executor side. This caused a java.lang.IllegalArgumentException: "" does not exist when flush() attempted to access the row key from the DataFrame schema using the empty field name.

Fix: Refactored HBaseRelation to ensure proper serialization of schema metadata:

Moved rowKey and columnFamilyMap from the companion object to immutable instance variables within the HBaseRelation class. Updated parseCatalog to return a tuple containing the StructType, rowKey, and columnFamilyMap. Updated the flush method to use these instance variables, ensuring that schema mapping information is correctly available on all executors.

fix #91

Solution Description

Problem: In a distributed Spark environment, the schema metadata (rowKey and columnFamilyMap) was stored in the HBaseRelation companion object. Since Scala companion objects (singleton/static variables) are not serialized and sent to executors, these variables were re-initialized to their default values (empty) on the executor side. This caused a java.lang.IllegalArgumentException: "" does not exist when flush() attempted to access the row key from the DataFrame schema using the empty field name. Fix: Refactored HBaseRelation to ensure proper serialization of schema metadata: Moved rowKey and columnFamilyMap from the companion object to immutable instance variables within the HBaseRelation class. Updated parseCatalog to return a tuple containing the StructType, rowKey, and columnFamilyMap. Updated the flush method to use these instance variables, ensuring that schema mapping information is correctly available on all executors.

davidzhangbj approved these changes Jan 26, 2026

View reviewed changes

Merge branch 'main' into fix-obkv

edf4fe3

yuanoOo merged commit 8f0f384 into oceanbase:main Jan 26, 2026
10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix: Fix schema meta-info serialization issue in HBaseRelation#94

bugfix: Fix schema meta-info serialization issue in HBaseRelation#94
yuanoOo merged 2 commits intooceanbase:mainfrom
yuanoOo:fix-obkv

yuanoOo commented Jan 23, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yuanoOo commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Solution Description

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yuanoOo commented Jan 23, 2026 •

edited

Loading