Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merge PR #11 from beta branch to main #24

Draft
wants to merge 148 commits into
base: main
Choose a base branch
from
Draft

Merge PR #11 from beta branch to main #24

wants to merge 148 commits into from

Conversation

StrongestNumber9
Copy link
Contributor

@StrongestNumber9 StrongestNumber9 commented Jul 4, 2024

This is still work in progress and should not be reviewed at this point

Original comments and PR #11

…n and HDFS storage (WIP). Skipping implementing idempotent consumer pattern to the Kafka consumer groups.
…-serialization in Kafka consumer processing.
…ad of definite values as definite requires continuous use of flush() after appending records to AVRO.
…n files with flush() after every record append. Improved test cases.
…erformance. Cleaned up namings in in DurationStatistics. Debugging faulty consumed kafka topic partition ordering.
… gets its own exclusive AVRO-file queueNamePrefix which is used to generate the AVRO-files where the records of the partition are stored.
@StrongestNumber9 StrongestNumber9 marked this pull request as draft July 4, 2024 09:06
Tiihott and others added 13 commits July 4, 2024 12:12
Fix to rpm packaging configuration
* Fix for issue #32 by enabling automatic TGT renewal.

* Replaced hadoop Configuration class usage with the child class HdfsConfiguration.

* Added simplified handling for consumed kafka records with null content.

* Added additional null content records to MockKafkaConsumerFactory for tests.

* Added exception handling for non-parseable records. Modified test data and tests accordingly.

* Moved NULL record handling from KafkaReader to DatabaseOutput. Changed KafkaConsumerTest accordingly.

* Added configuration flags for skipping empty and non RFC5424 records. Improved logging to give metadata for records causing the exceptions.

* Added configuration flags for automatic kerberos TGT renewal.

* Added separate record processing tests. Added cleanup of temporary files that weren't written to HDFS because of exception during processing.

* Added checkTGTAndReloginFromKeytab() functionality.

---------

Co-authored-by: Mikko Kortelainen <kortemik@users.noreply.github.com>
@kortemik
Copy link
Member

see #35 (review)

kortemik and others added 2 commits August 20, 2024 15:18
* Added encryption flags for configuration
* Added missing flags to example/test configuration
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants