OpenSnowcat-rdb-loader and constant warnings about potential CPU starvation #9
Replies: 2 comments 2 replies
-
|
@toni-loisko we are just doing minor updates on the RDB loader but let me investigate how we built our images. We are migrating everything from opensnowcat-rdb-loader to our Snowflake Streaming Loader and over time the opensnowcat-rdb-loader will be discontinued. We are simplifying how data is loaded, and need to maintain the project sustainable, so we have created a comercial loader that support Snowflake, Elastic 9, Redshift, Azure, BigQuery, etc. I would be happy to chat. I should post more insights on the build today. |
Beta Was this translation helpful? Give feedback.
-
|
@toni-loisko we have a similar setup and also see the same message. I've raise this to the person who maintains rdb loader and I'll see what he says. BTW are you using OpenSnowcat at nebenan.de? We would love to display your logo on opensnowcat.io! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
We're currently running OpenSnowcat-rdb-loader for loading data into Snowflake from S3 in our staging environment. Since no jar files nor official container image are shipped for this one we built it ourselves with
sbt snowflakeLoader/docker:stageanddocker buildx build --platform linux/arm64 -t snowcat-rdb-loader:latest .(for whatever reasonsbt snowflakeLoader/docker:publishLocaldid not produce working container image for our environment so extra steps it is then).With OpenSnowcat-rdb-loader we're seeing massive amounts of warnings like this:
And on the Snowflake side we're seeing following message, which I've been told happens between bulk ingests (in one instance the number of messages was counted to be 86):
alter session /* JDBC:SnowflakeConnectionV1.setAutoCommit*/ set autocommit= true|falseRunning environment is AWS EKS and instances themselves are of ARM Graviton 2, 3 and 4 (I think specifically c6gd, c7gd and c8gd). Rdb-loader gets following env variable:
JAVA_TOOL_OPTIONS: "-Xms256m -Xmx2g -XX:+UseSerialGC -XX:ActiveProcessorCount=1 --add-opens=java.base/java.nio=ALL-UNNAMED"(ALL-UNNAMED was added with OpenSnowcat-rdb-loader because it wouldn't start otherwise, it wasn't necessary when I was testing it from my workstation, but as soon as we pushed image into ECR and EKS application would crash on start which I found strange as this should be needed for jdk16 and later if I read correctly and container image is set with jdk11 I think it was).Config file being:
I'm pretty sure I was informed that data ingestion is still actually happening and data team can see data in Snowflake, but I'll get verification for that later, likely tomorrow.
So I'm wondering if the warnings and
set autocommitspam is something to worry about? Is this just more vocal about things versus old rdb-loader? We saw none of that before we started testing OpenSnowcat-rdb-loader. Looking at the capacity allocation both CPU and memory should be fine, neither is exactly starved or even close and I didn't see anything strange IO wise either on the actual hosts.Beta Was this translation helpful? Give feedback.
All reactions