Releases: redpanda-data/connect
v4.90.2
For installation instructions check out the getting started guide.
Added
- mysql_cdc: MySQL CDC connector now supports parallel snapshots to improve initial data capture performance when replicating large tables. (@josephwoodward, #4363)
- bigquery: Added storage write API connector. (@squiidz, #4220)
Fixed
- iceberg: Fix schema column ordering (@josephwoodward, #4373)
The full change log can be found here.
v4.90.1
For installation instructions check out the getting started guide.
Added
- mysql_cdc: MySQL CDC connector now supports parallel snapshots to improve initial data capture performance when replicating large tables. (@josephwoodward, #4363)
- bigquery: Added storage write API connector. (@squiidz, #4220)
Fixed
- iceberg: Fix schema column ordering (@josephwoodward, #4373)
The full change log can be found here.
v4.90.0
For installation instructions check out the getting started guide.
Added
- mysql_cdc: MySQL CDC connector now supports parallel snapshots to improve initial data capture performance when replicating large tables. (@josephwoodward, #4363)
- bigquery: Added storage write API connector. (@squiidz, #4220)
Fixed
- iceberg: Fix schema column ordering (@josephwoodward, #4373)
The full change log can be found here.
v4.89.3
For installation instructions check out the getting started guide.
Fixed
- iceberg: fix decimal min/max stats extraction for parquet files (@josephwoodward, #4368)
Added
- iceberg: Added a
case_sensitive_columnsfield to theicebergoutput. Whenfalse, column-name matching follows iceberg's recommended case-insensitive convention end-to-end (shredding, schema evolution, partition specs,schema_metadata). Defaults totruefor backwards compatibility. (@Jeffail)
The full change log can be found here.
v4.89.2
For installation instructions check out the getting started guide.
Fixed
- oracledb_cdc: Ensure LogMiner nullable numbers are consistent with snapshotting. (@josephwoodward, #4354)
Changed
v4.89.1
For installation instructions check out the getting started guide.
Fixed
- iceberg: Updated
iceberg-godependency that includes fix to seed UpdateSchema id counter from metadata LastColumnID. (@josephwoodward, #4343)
v4.89.0
For installation instructions check out the getting started guide.
Added
- general: PostgreSQL CDC benchmarking suite added with Kafka Connect PostgreSQL benchmark infrastructure and configuration. (@ness-david-dedu, #4216)
- parquet_encode: Added configurable timestamp unit support (nanosecond, microsecond, millisecond) to make Parquet output readable by Apache Spark, Databricks, AWS Athena, and DuckDB. (@ankit481, #4294)
- oracledb_cdc: Added
transaction_idto message metadata. (@josephwoodward, #4328) - oracledb_cdc: Added
commit_ts_msto message metadata. (@josephwoodward, #4331)
Fixed
- confluent: Avro schema reference resolution now handles arbitrary schema shapes and correctly inlines transitive references, fixing misleading errors and missing nested reference resolution. (@twmb, #4247)
- mysql_cdc: IAM token refresh and canal recreation before streaming prevents connection failures when snapshots delay binlog streaming. (@josephwoodward, #4295)
- oracledb_cdc: Oracle numeric values with missing leading zeros (e.g., '.5') are now normalized to valid JSON format for proper CDC streaming. (@josephwoodward, #4322)
Unreleased
Added
- parquet_encode: Added
default_timestamp_unitfield (valuesNANOSECOND,MICROSECOND,MILLISECOND) controlling the precision of TIMESTAMP logical types. Default remainsNANOSECONDfor backwards compatibility. UseMICROSECONDwhen writing files for Apache Spark/Databricks, AWS Athena or DuckDB, which do not supportTIMESTAMP(NANOS). (#3570)
The full change log can be found here.
v4.89.0-rc1
For installation instructions check out the getting started guide.
Added
- mysql_cdc: Added integration tests validating MySQL CDC connection drop and reconnection scenarios with zero data loss. (@mmatczuk, #4241)
- oracledb_cdc: Added multi-tenant support for Oracle CDC input to handle both container and non-container databases. (@josephwoodward, #4237)
- oracledb_cdc: Added source timestamp metadata to Oracle CDC messages to propagate the redo log timestamp to consumers. (@josephwoodward, #4250)
- otlp: Added OpenTelemetry metrics exporter supporting both gRPC and HTTP protocols. (@mmatczuk, #4230)
- arc: add output plugin for Arc columnar database (@xe-nvdk, #4265)
Fixed
- amqp1: Fixed data race in randomString by replacing non-goroutine-safe package-level rand with thread-safe top-level rand.Intn. (@mmatczuk, #4260)
- general: Fixed Docker latest tag for connect image from incorrect latest-cloud to latest. (@mmatczuk, #4267)
- mysql_cdc: Fixed snapshot column order consistency by using deterministic primary key slice iteration instead of non-deterministic map iteration. (@josephwoodward, #4262)
- postgresql: Fixed handling of tsvector type in PostgreSQL replication decoder to return raw PostgreSQL text representation. (@mmatczuk, #4261)
- protobuf: Fixed memory leak in hyperpb parser caused by pool holding unused messages and added profile-guided optimization support. (@mmatczuk, #4240)
Changed
- iceberg: Updated Iceberg fallback behavior to infer column type when schema doesn't contain the field, enabling support for transforms. (@rockwotj, #4263)
- oracledb_cdc: Switched from buffering all redo events to streaming them through a callback to reduce memory allocation. (@josephwoodward, #4243)
- mysql_cdc: write checkpoint after snapshot (@josephwoodward, #4269)
The full change log can be found here.
v4.88.0
For installation instructions check out the getting started guide.
Added
- mysql_cdc: Added integration tests validating MySQL CDC connection drop and reconnection scenarios with zero data loss. (@mmatczuk, #4241)
- oracledb_cdc: Added multi-tenant support for Oracle CDC input to handle both container and non-container databases. (@josephwoodward, #4237)
- oracledb_cdc: Added source timestamp metadata to Oracle CDC messages to propagate the redo log timestamp to consumers. (@josephwoodward, #4250)
- otlp: Added OpenTelemetry metrics exporter supporting both gRPC and HTTP protocols. (@mmatczuk, #4230)
- arc: add output plugin for Arc columnar database (@xe-nvdk, #4265)
Fixed
- amqp1: Fixed data race in randomString by replacing non-goroutine-safe package-level rand with thread-safe top-level rand.Intn. (@mmatczuk, #4260)
- general: Fixed Docker latest tag for connect image from incorrect latest-cloud to latest. (@mmatczuk, #4267)
- mysql_cdc: Fixed snapshot column order consistency by using deterministic primary key slice iteration instead of non-deterministic map iteration. (@josephwoodward, #4262)
- postgresql: Fixed handling of tsvector type in PostgreSQL replication decoder to return raw PostgreSQL text representation. (@mmatczuk, #4261)
- protobuf: Fixed memory leak in hyperpb parser caused by pool holding unused messages and added profile-guided optimization support. (@mmatczuk, #4240)
Changed
- iceberg: Updated Iceberg fallback behavior to infer column type when schema doesn't contain the field, enabling support for transforms. (@rockwotj, #4263)
- oracledb_cdc: Switched from buffering all redo events to streaming them through a callback to reduce memory allocation. (@josephwoodward, #4243)
- mysql_cdc: write checkpoint after snapshot (@josephwoodward, #4269)
The full change log can be found here.
v4.88.0-rc3
For installation instructions check out the getting started guide.
Added
oracledb_cdc: add source_ts_ms metadata - (@josephwoodward, #4250)
The full change log can be found here.