Bugfix against retry logic #576

debugthings · 2018-02-26T23:52:36Z

When reviewing this code to implement a few refactored items I noticed that there was still a retry hard code in the logic. This PR fixes that.

As it would be there are still a mix of tabs and spaces in the files that cause some additional lines for the commit that don't need to be there.

After adding the instant retry amount logic to the code this line of code could cause the transmissions to not back off.

grlima · 2018-02-27T01:42:41Z

...ava/com/microsoft/applicationinsights/internal/channel/common/TransmissionPolicyManager.java

        {
            long backOffSeconds = backOffMillis / 1000;
-            InternalLogger.INSTANCE.info("App is throttled, telemetry will be blocked for %s seconds.", backOffSeconds);
+            InternalLogger.INSTANCE.logAlways(InternalLogger.LoggingLevel.TRACE, "App is throttled, telemetry will be blocked for %s seconds.", backOffSeconds);


Will this flood the log and make users unhappy?

Well I think it has more consequences than just making user unhappy. Writing logs to disk (i.e file) or console is expensive application and during peak load there can be some sort of throttling in sending some telemetry but because of extensive logging this slow production application down and then can lead to missing important transactions (Say in high throughput e-commerce platform). This should not be log always.

I agree with @dhaval24, this shouldn't be logAlways. I would go with debug.

grlima

Still reviewing...

grlima

Previous review was for wrong PR. Sorry. Still reviewing this...

grlima · 2018-02-27T02:56:34Z

...ava/com/microsoft/applicationinsights/internal/channel/common/TransmissionNetworkOutput.java

@@ -189,10 +189,8 @@ public boolean send(Transmission transmission) {
 				respString = EntityUtils.toString(respEntity);
 				retryAfterHeader = response.getFirstHeader(RESPONSE_THROTTLING_HEADER);

-				// After the third time through this dispatcher we should reset the counter and
-				// then fail to second TransmissionOutput
-				if (code > HttpStatus.SC_PARTIAL_CONTENT && transmission.getNumberOfSends() >= MAX_RESEND) {


The variable MAX_RESEND is no longer needed

dhaval24

@debugthings I have few recommendations on this one, if you can take a look.

dhaval24 · 2018-02-27T16:28:15Z

...ava/com/microsoft/applicationinsights/internal/channel/common/TransmissionPolicyManager.java

        {
            long backOffSeconds = backOffMillis / 1000;
-            InternalLogger.INSTANCE.info("App is throttled, telemetry will be blocked for %s seconds.", backOffSeconds);
+            InternalLogger.INSTANCE.logAlways(InternalLogger.LoggingLevel.TRACE, "App is throttled, telemetry will be blocked for %s seconds.", backOffSeconds);


Well I think it has more consequences than just making user unhappy. Writing logs to disk (i.e file) or console is expensive application and during peak load there can be some sort of throttling in sending some telemetry but because of extensive logging this slow production application down and then can lead to missing important transactions (Say in high throughput e-commerce platform). This should not be log always.

dhaval24 · 2018-02-27T16:30:35Z

...ava/com/microsoft/applicationinsights/internal/channel/common/TransmissionPolicyManager.java

                return;
            }
-                       
+
            Date date = Calendar.getInstance().getTime();
            date.setTime(date.getTime() + 1000 * suspendInSeconds);


Can we add explicit braces here. date.setTime(date.getTime() + (1000 * suspendInSeconds))
I know that multiplication takes precedence over addition but I would still prefer to be explicit to avoid any malfunctions in situations.

dhaval24 · 2018-02-27T16:33:02Z

...ava/com/microsoft/applicationinsights/internal/channel/common/TransmissionPolicyManager.java

+     * Set the number of retries before performing a back off operation.
+     * @param maxInstantRetries Number of retries
+     */
+    public void setMaxInstantRetries(int maxInstantRetries) {


I think we should always keep minimum instant retires to 3. Allowing instant retries to be set to 0 will again invite us for similar troubles in constrained networks. I would suggest that we change the condition to reflect that.

instantRetries defaults to 3 if it is not set with this method, changed logic to only work if value is range [1..10]. I agree that 0 will put us in to a condition where we are backing off too soon, but disagree that we should always have this at 3 and give no option to lower.

grlima

Good to go.

@dhaval24

* Fix null ref check in telemetry correlation Utils (#541) * Fix null ref check in TelemetryCorrelationUtils * Modifying log level to warning * Updating Changelog * Fix handling of NaN and +/-Infinity in JSON serializer (#499) * Handle NaN and +/-Infinity in metrics * Default NaN/Infinity serialization to 0 to be consistent with other AI SDKs and make the code compatible with Java 6 * fixed javadoc errors and added section to generate pom.xml with all builds (#551) * Updating version number to 2.0.0 * Implementing Retry logic [Reliable Channel] [STABLE Branch] (#561) * Initial commit of retry and backoff logic fixes * Fixing warnings on files I touched this round * Fix the eclipse UI from screaming about the docker Contstants * Fixed backoff logic to use existing method. Added more logging to the sender channel. * Added the partial response handler, more logging * Added gson to core. Fixed backoff manager to keep original functionality. Added extension to return the timeout values as expected before. * Added unit tests. * Fixing string typed ArrayList<> to List<> per Dhaval * Missed one * Making tests consistent. * Added javadoc comments, simplified logic for a few methods * Added exception logging per @dhaval24. Fixed formatting on touched files * Updates per last round of commits Moved the Handlers out of the concrete package to the common package to keep the same consistency. Removed a couple of unessecary methods. Added docs. * Latest fixes * Add MaxInstantRetry Added MaxInstantRetry configuration to allow for instantaneous retry on a failed transmission. * Javadoc Updates Javadoc and formatting updates * NumberFormatException fix Added null check * JavaDocs for TPM * Fixing FixedRateSampling to work in old and new version of sampling (#540) Overriding default sampling percentage when programatically specified sampling percentage by user. * upgrade to logback v1.2.3 (#565) * Reliable channel: replacing logAlways "TRACE" statements with "info" (#571) * Reliable channel: close resources in finally block. (#572) * Reliable channel: close resources in finally block. * change logging to warning when closing resources * Bugfix against retry logic (#576) * Refactor * BUGFIX Logic would never backoff After adding the instant retry amount logic to the code this line of code could cause the transmissions to not back off. * Changes requested * Fixed javadocs tags, that caused build errors while executing `javadoc` gradle task (#578) * Update Changelog * Fix link in changelog * Fix another link in changelog * Update gradle.properties * Fix customizing pom.xml in Gradle build (#582) * Fix customizing pom.xml in Gradle build * Insert license after 1. row in pom.xml * Filter artifacts relocated by shadow task from pom dependencies - match artifacts by groupId - fixes #583 * Generate a pom file "beside" the artifact jar file

debugthings added 2 commits February 26, 2018 18:44

Refactor

6dd59b8

BUGFIX Logic would never backoff

e8ab424

After adding the instant retry amount logic to the code this line of code could cause the transmissions to not back off.

grlima reviewed Feb 27, 2018

View reviewed changes

grlima suggested changes Feb 27, 2018

View reviewed changes

grlima reviewed Feb 27, 2018

View reviewed changes

dhaval24 suggested changes Feb 27, 2018

View reviewed changes

Changes requested

bdb69e0

grlima approved these changes Feb 27, 2018

View reviewed changes

grlima merged commit 18b6fe2 into microsoft:2.0.0-STABLE Feb 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bugfix against retry logic #576

Bugfix against retry logic #576

debugthings commented Feb 26, 2018

grlima Feb 27, 2018

dhaval24 Feb 27, 2018

littleaj Feb 27, 2018

grlima left a comment •

edited

Loading

grlima left a comment

grlima Feb 27, 2018

dhaval24 left a comment

dhaval24 Feb 27, 2018

dhaval24 Feb 27, 2018

dhaval24 Feb 27, 2018

debugthings Feb 27, 2018

grlima left a comment

Bugfix against retry logic #576

Bugfix against retry logic #576

Conversation

debugthings commented Feb 26, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

grlima left a comment • edited Loading

Choose a reason for hiding this comment

grlima left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dhaval24 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

grlima left a comment

Choose a reason for hiding this comment

grlima left a comment •

edited

Loading