Comments (5)
@artem-garmash thank you for reporting this. Sorry for the late reply! We will take a look at this issue.
from pulsar-io-cloud-storage.
The error is because we can't get the Message instance, because the Source in Function is not a topic message, but postgresql, and cloud-storage expects to get the PulsarRecord message.
During the run, if I use only Record data and not PulsarRecord data, I will not be able to process Schema. So I need to use Pulsar topic as source.
from pulsar-io-cloud-storage.
@artem-garmash pulsar-io-cloud-storage needs to use a schema to hold structural information about the data, so it can only use the Pulsar topic as message input. You need to write your records to the topic and then use pulsar-io-cloud-storage to write them out.
There was sensitive information in the log and I removed it.
from pulsar-io-cloud-storage.
@jianyun8023, thanks for the reply. My pulsar topic had schema definition as you can see in the bug description. I literally followed README instructions 1:1 to test pulsar-io-cloud-storage but it didn't work.
At the end of my attempts I tried to use jdbc connector with postgres sink instead of pulsar-io-cloud-storage just to verify that my topic/setup worked. And it all worked with jdbc connector, consuming the same pulsar topic.
from pulsar-io-cloud-storage.
I cannot reproduce this issue with the latest version, so I assume this issue has been resolved in past commits. Feel free to re-open it if there is any more context.
from pulsar-io-cloud-storage.
Related Issues (20)
- [FEATURE] support aws native SDK
- [FEATURE] should write schema version in a human-readable format HOT 2
- [BUG] Pulsar cloud storage connector can't convert PROTO_NATIVE schema to Parquet schema HOT 3
- [BUG] Upgrade hadoop common to 3.3.3 to solve CVE-2022-26612
- [BUG] extract schema info from messages without schema version
- [BUG] Time Partitioner not working as expected for GCS HOT 1
- [BUG] failed to extract schema version for AVRO format and causing cast error
- [BUG] error while writing data in Parquet format using JsonSchema as schema format HOT 3
- [FEATURE] Add a new batch triggering mechanism that is based on the batch size
- [FEATURE] Pending Queuesize default to the maxbatch size
- [BUG] Connector does not create output files for each topic when reading multiple topics
- [FEATURE] Azure Blob Storage support
- [BUG] PR template should correctly add doc labels based on description
- [BUG] formatType not working as expected HOT 1
- [BUG] Sink flushes only when batchTimeMs is exceeded for large batchSize
- [BUG] Files generated per topic have fluctuating size instead of `batchSize`
- [BUG] `maxTimeoutMs` should apply at the topic level
- Update master to JDK17 HOT 1
- [BUG] NPE if partitionerType is not set
- [BUG] The connector covert array json bytes failed.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pulsar-io-cloud-storage.