feat: use GitHub's default README as index page #255

dandhlee · 2022-10-05T05:20:26Z

Instead of using a custom index page that provided little to no benefit, this PR is incorporating the existing README as the main index page. This was asked by a lot of DPEs for their client libraries' landing page.

The markdown generator unfortunately messed around with the spacing for image links (see tests/markdown_example_bad_image_links.md file for reference), but all other parts I've verified is looking good. (See staged version of the doc for https://cloud.google.com/python/docs/reference/storage/latest). Adding a small parser to help fix that for the index file.

Fixes #105.
Fixes #107.

Tests pass

tbpg · 2022-10-05T12:56:46Z

docfx_yaml/extension.py

+    """Cleans extra whitespace that breaks image links in index.html file."""
+    image_link_pattern='\[\s*!\[image\]\(.*\)\s*\]\(.*\)'
+    new_lines = []
+    with open(mdfile_path) as mdfile_iterator:


Just name this mdfile? No need for the type suffix.

tbpg · 2022-10-05T12:58:22Z

docfx_yaml/extension.py

+        broken_image_links = [
+            [m.start(), m.end()]
+            for m in re.finditer(image_link_pattern, file_content)
+        ]


This seems like it could be done in a single loop, rather than creating this list then iterating over it below?

You're right. Removed the redundant list.

tbpg · 2022-10-05T12:58:55Z

docfx_yaml/extension.py

+
+        new_lines.append(file_content[prev_end:])
+
+    with open(mdfile_path, 'w') as mdfile_iterator:


Same naming comment (especially confusing because this writes to something named iterator, which doesn't really make sense to me).

tbpg · 2022-10-05T12:59:43Z

docfx_yaml/extension.py

@@ -1448,16 +1477,16 @@ def prepend_markdown_header(filename: str, mdfile: Iterable[str]):
 def find_markdown_pages(app, outdir):
    # Use this to ignore markdown files that are unnecessary.
    files_to_ignore = [
-        "index.md",     # merge index.md and README.md and index.yaml later.
-                        # See https://github.com/googleapis/sphinx-docfx-yaml/issues/105.
+        "index.md",     # use readme.md instead

        "reference.md", # Reference docs overlap with Overview. Will try and incorporate this in later.


Do we still want to ignore this?

Yes - the current setup after running Sphinx build creates index.md and readme.md, the default index.md contains a bit more info than readme.md file that we want to use. So, we'll continue to ignore it, overwrite it with readme.md file instead.

tbpg · 2022-10-05T13:01:42Z

docfx_yaml/extension.py

+
+            shutil.copy(mdfile, f"{outdir}/{mdfile_name_to_use}")
+
+            # Do not add index file to the TOC.


Why not? Seems like we could set name to Overview here and not need the special handling down below ([{'name': 'Overview', 'href': 'index.md'}] + app.env.markdown_pages + pkg_toc_yaml)?

Perhaps for sorting? Still seems like it would be better to handle all md pages in this function.

Putting the handler here did make the order tweak a bit (Overview doesn't always appear at the top). But I agree that the logic should be streamlined. Modified the handler here instead.

tbpg · 2022-10-05T13:02:14Z

docfx_yaml/extension.py

+            # Do not add index file to the TOC.
+            if mdfile_name_to_use == 'index.md':
+                # Clean up any broken image links for the index file.
+                clean_image_links(f"{outdir}/{mdfile_name_to_use}")


This doesn't apply for every md file?

I've only seen this for the README pages containing the image, but wouldn't hurt to look for in every page. Updated.

tbpg · 2022-10-05T13:06:09Z

tests/markdown_example_bad_image_links.md

+
+[
+
+![image](https://img.shields.io/badge/support-stable-gold.svg)


Seems like GitHub doesn't render these properly?

Currently it doesn't because they're broken up - when the whitespaces are all trimmed then it should look fine, see https://github.com/googleapis/sphinx-docfx-yaml/blob/d016f0422bfb829df4eeb9fe8a9d10a5617f0e69/tests/markdown_example_bad_image_links_want.md

dandhlee

Thank you! Please take a look again.

dandhlee · 2022-10-05T13:13:55Z

docfx_yaml/extension.py

+    """Cleans extra whitespace that breaks image links in index.html file."""
+    image_link_pattern='\[\s*!\[image\]\(.*\)\s*\]\(.*\)'
+    new_lines = []
+    with open(mdfile_path) as mdfile_iterator:


dandhlee · 2022-10-05T13:15:43Z

docfx_yaml/extension.py

+        broken_image_links = [
+            [m.start(), m.end()]
+            for m in re.finditer(image_link_pattern, file_content)
+        ]


You're right. Removed the redundant list.

dandhlee · 2022-10-05T13:15:56Z

docfx_yaml/extension.py

+
+        new_lines.append(file_content[prev_end:])
+
+    with open(mdfile_path, 'w') as mdfile_iterator:


dandhlee · 2022-10-05T13:19:20Z

docfx_yaml/extension.py

@@ -1448,16 +1477,16 @@ def prepend_markdown_header(filename: str, mdfile: Iterable[str]):
 def find_markdown_pages(app, outdir):
    # Use this to ignore markdown files that are unnecessary.
    files_to_ignore = [
-        "index.md",     # merge index.md and README.md and index.yaml later.
-                        # See https://github.com/googleapis/sphinx-docfx-yaml/issues/105.
+        "index.md",     # use readme.md instead

        "reference.md", # Reference docs overlap with Overview. Will try and incorporate this in later.


Yes - the current setup after running Sphinx build creates index.md and readme.md, the default index.md contains a bit more info than readme.md file that we want to use. So, we'll continue to ignore it, overwrite it with readme.md file instead.

dandhlee · 2022-10-05T13:22:53Z

tests/markdown_example_bad_image_links.md

+
+[
+
+![image](https://img.shields.io/badge/support-stable-gold.svg)


Currently it doesn't because they're broken up - when the whitespaces are all trimmed then it should look fine, see https://github.com/googleapis/sphinx-docfx-yaml/blob/d016f0422bfb829df4eeb9fe8a9d10a5617f0e69/tests/markdown_example_bad_image_links_want.md

dandhlee · 2022-10-05T13:23:30Z

docfx_yaml/extension.py

+            # Do not add index file to the TOC.
+            if mdfile_name_to_use == 'index.md':
+                # Clean up any broken image links for the index file.
+                clean_image_links(f"{outdir}/{mdfile_name_to_use}")


I've only seen this for the README pages containing the image, but wouldn't hurt to look for in every page. Updated.

dandhlee · 2022-10-05T13:25:56Z

docfx_yaml/extension.py

+
+            shutil.copy(mdfile, f"{outdir}/{mdfile_name_to_use}")
+
+            # Do not add index file to the TOC.


Putting the handler here did make the order tweak a bit (Overview doesn't always appear at the top). But I agree that the logic should be streamlined. Modified the handler here instead.

feat: use GitHub's default README as index page

d016f04

dandhlee requested review from a team as code owners October 5, 2022 05:20

product-auto-label bot added the size: m Pull request size is medium. label Oct 5, 2022

tbpg reviewed Oct 5, 2022

View reviewed changes

feat: address review comemnts

4fb9f36

dandhlee commented Oct 5, 2022

View reviewed changes

tbpg approved these changes Oct 5, 2022

View reviewed changes

dandhlee merged commit 17f6ca0 into main Oct 5, 2022

dandhlee deleted the merge_readme branch October 5, 2022 17:25

release-please bot mentioned this pull request Oct 5, 2022

chore(main): release 1.6.0 #256

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use GitHub's default README as index page #255

feat: use GitHub's default README as index page #255

dandhlee commented Oct 5, 2022

tbpg Oct 5, 2022

dandhlee Oct 5, 2022

tbpg Oct 5, 2022

dandhlee Oct 5, 2022

tbpg Oct 5, 2022

dandhlee Oct 5, 2022

tbpg Oct 5, 2022

dandhlee Oct 5, 2022

tbpg Oct 5, 2022

dandhlee Oct 5, 2022

tbpg Oct 5, 2022

dandhlee Oct 5, 2022

tbpg Oct 5, 2022

dandhlee Oct 5, 2022

dandhlee left a comment

dandhlee Oct 5, 2022

dandhlee Oct 5, 2022

dandhlee Oct 5, 2022

dandhlee Oct 5, 2022

dandhlee Oct 5, 2022

dandhlee Oct 5, 2022

dandhlee Oct 5, 2022


		new_lines.append(file_content[prev_end:])

		with open(mdfile_path, 'w') as mdfile_iterator:


		shutil.copy(mdfile, f"{outdir}/{mdfile_name_to_use}")

		# Do not add index file to the TOC.


		[

		![image](https://img.shields.io/badge/support-stable-gold.svg)

feat: use GitHub's default README as index page #255

feat: use GitHub's default README as index page #255

Conversation

dandhlee commented Oct 5, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dandhlee left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment