onnxruntime/tools/doc/rename_folders.py
Justin Chu d834ec895a
Adopt linrtunner as the linting tool - take 2 (#15085)
### Description

`lintrunner` is a linter runner successfully used by pytorch, onnx and
onnx-script. It provides a uniform experience running linters locally
and in CI. It supports all major dev systems: Windows, Linux and MacOs.
The checks are enforced by the `Python format` workflow.

This PR adopts `lintrunner` to onnxruntime and fixed ~2000 flake8 errors
in Python code. `lintrunner` now runs all required python lints
including `ruff`(replacing `flake8`), `black` and `isort`. Future lints
like `clang-format` can be added.

Most errors are auto-fixed by `ruff` and the fixes should be considered
robust.

Lints that are more complicated to fix are applied `# noqa` for now and
should be fixed in follow up PRs.

### Notable changes

1. This PR **removed some suboptimal patterns**:

	- `not xxx in` -> `xxx not in` membership checks
	- bare excepts (`except:` -> `except Exception`)
	- unused imports
	
	The follow up PR will remove:
	
	- `import *`
	- mutable values as default in function definitions (`def func(a=[])`)
	- more unused imports
	- unused local variables

2. Use `ruff` to replace `flake8`. `ruff` is much (40x) faster than
flake8 and is more robust. We are using it successfully in onnx and
onnx-script. It also supports auto-fixing many flake8 errors.

3. Removed the legacy flake8 ci flow and updated docs.

4. The added workflow supports SARIF code scanning reports on github,
example snapshot:
	

![image](https://user-images.githubusercontent.com/11205048/212598953-d60ce8a9-f242-4fa8-8674-8696b704604a.png)

5. Removed `onnxruntime-python-checks-ci-pipeline` as redundant

### Motivation and Context
<!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. -->

Unified linting experience in CI and local.

Replacing https://github.com/microsoft/onnxruntime/pull/14306

---------

Signed-off-by: Justin Chu <justinchu@microsoft.com>
2023-03-24 15:29:03 -07:00

84 lines
2.6 KiB
Python

"""
Github publishes the markdown documentation with jekyll enabled.
This extension does not publish any folder starting with `_`.
These folders need to be renamed.
"""
import os
import re
def rename_folder(root):
"""
Renames all folder starting with `_`.
Returns the list of renamed folders.
"""
found = []
for r, dirs, _files in os.walk(root):
for name in dirs:
if name.startswith("_"):
found.append((r, name))
renamed = []
for r, name in found:
into = name.lstrip("_")
renamed.append((r, name, into))
full_src = os.path.join(r, name)
full_into = os.path.join(r, into)
if os.path.exists(full_into):
raise RuntimeError("%r already exists, previous documentation should be removed.")
print("rename %r" % full_src)
os.rename(full_src, full_into)
return renamed
def replace_files(root, renamed):
subs = {r[1]: r[2] for r in renamed}
reg = re.compile('(\\"[a-zA-Z0-9\\.\\/\\?\\:@\\-_=#]+\\.([a-zA-Z]){2,6}' '([a-zA-Z0-9\\.\\&\\/\\?\\:@\\-_=#])*\\")')
for r, _dirs, files in os.walk(root):
for name in files:
if os.path.splitext(name)[-1] != ".html":
continue
full = os.path.join(r, name)
with open(full, encoding="utf-8") as f:
content = f.read()
find = reg.findall(content)
repl = []
for f in find:
if f[0].startswith("http"):
continue
for k, v in subs.items():
if k == v:
raise ValueError(f"{k!r} == {v!r}")
if ('"%s' % k) in f[0]:
repl.append((f[0], f[0].replace('"%s' % k, '"%s' % v)))
if ("/%s" % k) in f[0]:
repl.append((f[0], f[0].replace("/%s" % k, "/%s" % v)))
if len(repl) == 0:
continue
print("update %r" % full)
for k, v in repl:
content = content.replace(k, v)
with open(full, "w", encoding="utf-8") as f:
f.write(content)
if __name__ == "__main__":
import sys
if len(sys.argv) > 1:
root = sys.argv[-1]
else:
root = "../../build/docs/html"
print("look into %r" % root)
ren = rename_folder(root)
if len(ren) == 0:
ren = [
("", "_static", "static"),
("", "_images", "images"),
("", "_downloads", "downloads"),
("", "_sources", "sources"),
("", "_modules", "modules"),
]
replace_files(root, ren)
print("done.")