Make log handlers configurable, shorten entries #378

borzunov · 2021-09-04T03:38:42Z

Preliminaries

The current implementation of hivemind.utils.logging has several problems (they are not related to the colored logging and exist for a long time):

Bugs: Sometimes, one message is logged multiple times. This due to the combination of the bugs:
- If name stays the same, logging.getLogger(name) always returns the same logger instance. We expect the same behavior from hivemind's get_logger(name), however it adds a new logging.StreamHandler to the logger instance every time. If it is called N times, you will have N stream handlers and each message will be repeated N times.
- hivemind's get_logger(name) trims the first item in the module path. While this is intended to make the log line shorter by trimming the hivemind. string, this actually forces all modules at the root scope (e.g. utils.py and huggingface_auth.py at tanmoyai/sahajbert) to use the same logger. Because of the previous bug, we end up logging the same message multiple times even if we use different names.
It is not obvious how to force other libraries (such as transformers) to use our logging style, so we have inconsistent log line styles in the example. Screenshot:
Since hivemind is a library, a developer may want to use it but keep the existing logging style in their application (i.e. force hivemind to follow it). Currently, there is no way to do it since get_logger() does not use message propagation to the application loggers.

Solution

First, we fix the bugs above, making get_logger() idempotent and avoiding trimming the actual logger name.
Next, we note that there are 3 possible use cases:
- (default) A user tries importing hivemind. It uses its own logging style among the package.
- A user likes the hivemind logging style and wants to use it among their application (e.g. force other libraries to follow it).
- A user does not like the hivemind logging style and wants to force it to follow another existing style.
We give a user a straightforward way to switch between these use cases via a special function:
```
use_hivemind_log_handler("in_hivemind")     # Option 1 (default)
use_hivemind_log_handler("in_root_logger")  # Option 2, make the root logger to use hivemind style
use_hivemind_log_handler("nowhere")         # Option 3, propagate hivemind logs to the existing root logger
```
Note: This approach is inspired by the transformers.logging module (docs, source). The module allows to enable/disable the propagation to the root logger and enable/disable the transformers default log style. However, our API is even higher-level.
We enable the in_root_logger mode in examples/albert, so that all messages (from __main__, transformers, and hivemind itself) consistently follow the hivemind style. Screenshot:
We change some log messages to improve their presentation.

codecov · 2021-09-04T03:40:10Z

Codecov Report

Merging #378 (3c6c551) into master (fb3f57b) will decrease coverage by 0.17%.
The diff coverage is 55.76%.

@@            Coverage Diff             @@
##           master     #378      +/-   ##
==========================================
- Coverage   84.23%   84.05%   -0.18%     
==========================================
  Files          70       70              
  Lines        6348     6383      +35     
==========================================
+ Hits         5347     5365      +18     
- Misses       1001     1018      +17

Impacted Files	Coverage Δ
hivemind/optim/collaborative.py	`25.49% <14.28%> (ø)`
hivemind/utils/logging.py	`72.82% <62.22%> (-13.14%)`	⬇️

borzunov · 2021-09-04T04:38:02Z

examples/albert/run_trainer.py

-    logger.warning(
-        f"Process rank: {training_args.local_rank}, device: {training_args.device}, n_gpu: {training_args.n_gpu}"
-        + f"distributed training: {bool(training_args.local_rank != -1)}, 16-bits training: {training_args.fp16}"
-    )


This message is removed since training_args are already logged.

hivemind/utils/logging.py

mryab · 2021-09-05T13:11:01Z

hivemind/utils/logging.py

+    elif _current_mode == StyleMode.EVERYWHERE:
+        _disable_default_handler(None)


I'm not sure if this is going to be a frequent enough use case, aside from our examples. Hivemind is first and foremost a library, and from my understanding, no other library implements this format hijacking mechanism: thus, I am not sure that this feature is actually required and expected to be present in a library about distributed DL. Probably, the reason for this is that it's usually expected that logs from the given library are consistent in their format in all applications for ease of support and issue reporting, and thus one has no direct incentive to implicitly adopt the logging format of other library

This isn't a blocker, I'm just afraid that adding unnecessary/rarely used features should not be our focus regarding logging

hivemind/utils/logging.py

borzunov · 2021-09-05T20:13:04Z

examples/albert/run_trainer.py

-        format="%(asctime)s - %(levelname)s - %(name)s -   %(message)s",
-        datefmt="%m/%d/%Y %H:%M:%S",
-        level=logging.INFO if is_main_process(training_args.local_rank) else logging.WARN,
-    )


This basicConfig() setting had no effect:

> This function does nothing if the root logger already has handlers configured, unless the keyword argument force is set to True.

borzunov · 2021-09-06T22:26:56Z

@justheuristic has verbally approved this.

borzunov requested review from justheuristic and mryab September 4, 2021 03:38

borzunov mentioned this pull request Sep 4, 2021

[hotfix] Fix logging message line multiple times tanmoyio/sahajBERT#3

Merged

borzunov commented Sep 4, 2021

View reviewed changes

mryab requested changes Sep 5, 2021

View reviewed changes

mryab reviewed Sep 5, 2021

View reviewed changes

hivemind/utils/logging.py Outdated Show resolved Hide resolved

borzunov commented Sep 5, 2021

View reviewed changes

borzunov added 9 commits September 6, 2021 21:29

Fix get_logger() reuse and module name trimming

402379f

Fix logging in run_trainer.py

b3cde88

Make transformers logging use same style

6b2e46e

Implement logging modes

ca9c618

Make interface more friendly

f559bc1

Blackify

f50bbbb

Fix @mryab's comments

61945b4

Improve names, add docstrings

1c490bd

Move HandlerMode definition

d051bc9

borzunov force-pushed the fix-get-logger branch from 69b8acb to d051bc9 Compare September 6, 2021 18:29

borzunov added 6 commits September 6, 2021 21:32

Remove special case stripping "hivemind."

3e186dd

Rename fetch_collaboration_state() -> _fetch_state()

825a45b

Move global variable definitions, remove PACKAGE_NAME

d57a75d

Log prefix in CollaborativeOptimizer

63e0e2d

Move enum definition

6bf9073

Fix @mryab's comments

cbe402e

mryab approved these changes Sep 6, 2021

View reviewed changes

borzunov added 2 commits September 7, 2021 01:08

Add "towards step N" to the log message

ee1733b

Add "#" before step numbers in log messages

3c6c551

borzunov changed the title ~~Rework hivemind.utils.logging~~ Improve hivemind.utils.logging and log messages Sep 6, 2021

borzunov changed the title ~~Improve hivemind.utils.logging and log messages~~ Improve hivemind.utils.logging and some log messages Sep 6, 2021

mryab changed the title ~~Improve hivemind.utils.logging and some log messages~~ Make log handlers configurable, shorten entries Sep 6, 2021

borzunov merged commit b84f62b into master Sep 6, 2021

borzunov deleted the fix-get-logger branch September 6, 2021 23:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make log handlers configurable, shorten entries #378

Make log handlers configurable, shorten entries #378

borzunov commented Sep 4, 2021 •

edited

Loading

codecov bot commented Sep 4, 2021 •

edited

Loading

borzunov Sep 4, 2021

mryab Sep 5, 2021

mryab Sep 5, 2021

borzunov Sep 5, 2021

borzunov commented Sep 6, 2021

		elif _current_mode == StyleMode.EVERYWHERE:
		_disable_default_handler(None)

Make log handlers configurable, shorten entries #378

Make log handlers configurable, shorten entries #378

Conversation

borzunov commented Sep 4, 2021 • edited Loading

Preliminaries

Solution

codecov bot commented Sep 4, 2021 • edited Loading

Codecov Report

borzunov Sep 4, 2021

Choose a reason for hiding this comment

mryab Sep 5, 2021

Choose a reason for hiding this comment

mryab Sep 5, 2021

Choose a reason for hiding this comment

borzunov Sep 5, 2021

Choose a reason for hiding this comment

borzunov commented Sep 6, 2021

borzunov commented Sep 4, 2021 •

edited

Loading

codecov bot commented Sep 4, 2021 •

edited

Loading