alibaba
diff --git a/‎.gitignore
+158 b/‎.gitignore
+158
diff --git a/‎.isort.cfg
+2 b/‎.isort.cfg
+2
diff --git a/‎README.md
+10-11 b/‎README.md
+10-11
diff --git a/‎docs/Makefile
+19 b/‎docs/Makefile
+19
diff --git a/‎docs/README.md
+98 b/‎docs/README.md
+98
diff --git a/‎docs/build_docs.sh
+11 b/‎docs/build_docs.sh
+11
@@ -0,0 +1,158 @@
+*.tar.gz
+*.zip
+*.gz
+*.txt
+*.tsv
+*.csv
+.idea
+easytexminer.egg-info
+build
+tools
+xflow_deploy
+dist
+.DS_Store
+__pycache__
+### Python template
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+
+# C extensions
+*.so
+
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+cover/
+
+# Translations
+*.mo
+*.pot
+
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+
+# Flask stuff:
+instance/
+.webassets-cache
+
+# Scrapy stuff:
+.scrapy
+
+# Sphinx documentation
+docs/_build/
+
+# PyBuilder
+.pybuilder/
+target/
+
+# Jupyter Notebook
+.ipynb_checkpoints
+
+# IPython
+profile_default/
+ipython_config.py
+
+# pyenv
+#   For a library or package, you might want to ignore these files since the code is
+#   intended to run in multiple environments; otherwise, check them in:
+# .python-version
+
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow
+__pypackages__/
+
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+
+# SageMath parsed files
+*.sage.py
+
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+
+# Spyder project settings
+.spyderproject
+.spyproject
+
+# Rope project settings
+.ropeproject
+
+# mkdocs documentation
+/site
+
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+user_datasets
+# Pyre type checker
+.pyre/
+
+# pytype static type analyzer
+.pytype/
+
+# Cython debug symbols
+cython_debug/
+
+glue_data/
+results/
+logs/
+.vscode/
@@ -0,0 +1,2 @@
+[settings]
+known_third_party = numpy,requests,rouge,scipy,setuptools,sklearn,sphinx_rtd_theme,torch,tqdm
@@ -2,7 +2,7 @@
 
 <p align="center">
     <br>
-    <img src="https://cdn.nlark.com/yuque/0/2022/png/2480469/1649297935073-2fce0ec9-ec8c-490f-bc25-a8cf50d9918f.png" width="300"/>
+    <img src="https://cdn.nlark.com/yuque/0/2022/png/2480469/1649297935073-2fce0ec9-ec8c-490f-bc25-a8cf50d9918f.png" width="200"/>
     <br>
 <p>
 
@@ -24,13 +24,11 @@ EasyNLP is an easy-to-use NLP development and application toolkit in PyTorch, fi
 - **Compatible with open-source libraries:** EasyNLP has APIs to support the training of models from Huggingface/Transformers with the PAI distributed framework. It also supports the pre-trained models in EasyTransfer ModelZoo.
 - **Knowledge-injected pre-training:** The PAI team has a lot of research on knowledge-injected pre-training, and builds a knowledge-injected model that wins the first place in the CCF knowledge pre-training competition. EasyNLP integrates these cutting-edge knowledge pre-trained models, including DKPLM and KGBERT.
 - **Landing large pre-trained models:** EasyNLP provides few-shot learning capabilities, allowing users to finetune large models with only a few samples to achieve good results. At the same time, it provides knowledge distillation functions to help quickly distill large models to a small and efficient model to faciliate online deployment.
-- **Seamless integration to PAI products:** It is seamlessly integrated to [Platform of AI (PAI)](https://www.aliyun.com/product/bigdata/product/learn) products, including PAI-DSW for development, PAI-DLC for cloud-native training, PAI-EAS for serving, and PAI-Designer for zero-coding model training.
-
-
+- **Seamless integration to PAI products::** It is seamlessly integrated to [Platform of AI (PAI)](https://www.aliyun.com/product/bigdata/product/learn) products, including PAI-DSW for development, PAI-DLC for cloud-native training, PAI-EAS for serving, and PAI-Designer for zero-coding model training.
 
 # Installation
 
-You can either install from pip 
+You can either install from pip
 
 ```bash
 $ pip install pai-easynlp (to be released)
@@ -43,11 +41,12 @@ $ git clone https://github.com/alibaba/EasyNLP.git
 $ cd EasyNLP
 $ python setup.py install
 ```
-This repo is tested on Python3.6, PyTorch >= 1.8.
 
+This repo is tested on Python3.6, PyTorch >= 1.8.
 
 # Quick Start
-Now let's show how to use just a few lines of code to build a text classification model based on BERT. 
+
+Now let's show how to use just a few lines of code to build a text classification model based on BERT.
 
 ```python
 from easynlp.core import Trainer
@@ -71,6 +70,7 @@ Trainer(model=model,  train_dataset=train_dataset).train()
 ```
 
 Then you can run the code:
+
 ```bash
 python main.py \
   --mode train \
@@ -85,7 +85,7 @@ python main.py \
   --user_defined_parameters='pretrain_model_name_or_path=bert-tiny-uncased'
 ```
 
-You can also use AppZoo Command Line Tools to quickly train an App model. Take text classification on SST-2 dataset as an example. First you can download the [train.tsv](http://atp-modelzoo-sh.oss-cn-shanghai.aliyuncs.com/release/tutorials/classification/train.tsv), and [dev.tsv](http://atp-modelzoo-sh.oss-cn-shanghai.aliyuncs.com/release/tutorials/classification/dev.tsv), then start training: 
+You can also use AppZoo Command Line Tools to quickly train an App model. Take text classification on SST-2 dataset as an example. First you can download the [train.tsv](http://atp-modelzoo-sh.oss-cn-shanghai.aliyuncs.com/release/tutorials/classification/train.tsv), and [dev.tsv](http://atp-modelzoo-sh.oss-cn-shanghai.aliyuncs.com/release/tutorials/classification/dev.tsv), then start training:
 
 ```bash
 $ easynlp \
@@ -117,9 +117,8 @@ $ easynlp \
   --checkpoint_path=./classification_model \
   --app_name=text_classify
 ```
-To learn more about the usage of AppZoo, please refer to our [documentation](https://www.yuque.com/easyx/easynlp/psm6fr).
-
 
+To learn more about the usage of AppZoo, please refer to our [documentation](https://www.yuque.com/easyx/easynlp/psm6fr).
 
 # Tutorials
 
@@ -133,8 +132,8 @@ To learn more about the usage of AppZoo, please refer to our [documentation](htt
 - [小样本学习实践](https://www.yuque.com/easyx/easynlp/vgbopy)
 - API docs: [http://atp-modelzoo-sh.oss-cn-shanghai.aliyuncs.com/release/easynlp/easynlp_docs/html/index.html](http://atp-modelzoo-sh.oss-cn-shanghai.aliyuncs.com/release/easynlp/easynlp_docs/html/index.html)
 
-
 # Contact Us
+
 Scan the following QR codes to join Dingtalk discussion group. The group discussions are most in Chinese, but English is also welcomed.
 
 <img src="https://cdn.nlark.com/yuque/0/2020/png/2480469/1600310258842-d7121051-32f1-494b-a7a5-a35ede74b6c4.png#align=left&display=inline&height=352&margin=%5Bobject%20Object%5D&name=image.png&originHeight=1178&originWidth=1016&size=312154&status=done&style=none&width=304" width="300"/>
 
@@ -0,0 +1,19 @@
+# Minimal makefile for Sphinx documentation
+#
+
+# You can set these variables from the command line.
+SPHINXOPTS    =
+SPHINXBUILD   = sphinx-build
+SOURCEDIR     = source
+BUILDDIR      = build
+
+# Put it first so that "make" without argument is like "make help".
+help:
+	@$(SPHINXBUILD) -M help "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
+
+.PHONY: help Makefile
+
+# Catch-all target: route all unknown targets to Sphinx using the new
+# "make mode" option.  $(O) is meant as a shortcut for $(SPHINXOPTS).
+%: Makefile
+	@$(SPHINXBUILD) -M $@ "$(SOURCEDIR)" "$(BUILDDIR)" $(SPHINXOPTS) $(O)
@@ -0,0 +1,98 @@
+### How to maintain docs
+
+### 1. Install easy transfer
+
+```
+$ cd /to/dir/EasyNLP
+$ python setup.py install
+```
+
+### 2. Install sphinx
+
+```bash
+$ pip install sphinx
+$ pip install sphinx_rtd_theme
+```
+
+### 3. Add modules
+
+#### 3.1  Add class or functions in existing files
+
+You need to add class or functions with `docstring` into the attached file.
+
+1. Google Python Style Guide [link](http://google.github.io/styleguide/pyguide.html#381-docstrings)
+1. Google docstring Sample [link](https://sphinxcontrib-napoleon.readthedocs.io/en/latest/example_google.html)
+1. Sample project：torch.nn.modules.conv [link](https://pytorch.org/docs/stable/_modules/torch/nn/modules/conv.html#Conv1d)
+1. Take `easynlp.appzoo.classification.BertTextClassify` as example：
+
+````python
+class BertTextClassify(BertPreTrainedModel):
+    """
+        Transformer model from ```Attention Is All You Need''',
+        Original paper: https://arxiv.org/abs/1706.03762
+
+        Args:
+            num_token (int): vocab size.
+            num_layer (int): num of layer.
+            num_head (int): num of attention heads.
+            embedding_dim (int): embedding dimension.
+            attention_head_dim (int): attention head dimension.
+            feed_forward_dim (int): feed forward dimension.
+            initializer: initializer type.
+            activation: activation function.
+            dropout (float): dropout rate (0.0 to 1.0).
+            attention_dropout (float): dropout rate for attention layer.
+
+        Returns: None
+    """
+````
+
+#### 3.2  Add new file
+
+For example, if you need to add a new file in `easynlp/data` and the file name is `blackmagic.py` with a `BlackMagic` class:
+
+1. Add `docstring` to the code
+1. In `docs/source/api/data.rst`，Find a position for `blackmagic` and add
+
+```rst
+blackmagic
+--------------------------------------
+
+.. automodule:: easynlp.data.blackmagic
+    :members:
+    :undoc-members:
+    :show-inheritance:
+
+```
+
+#### 3.3  Add new directory
+
+For example, you want to add a `magic` directory in `ez_transfer`，and there is a file named `blackmagic.py` with a `BlackMagic` class:
+
+1. Add `docstring` to the code
+1. Add file `docs/source/api/magic.rst` and write the following line
+
+```rst
+ez\_transfer.magic
+===========================
+```
+
+3. In `docs/source/api/magic.rst`，Find a position for `blackmagic` and add
+
+```rst
+blackmagic
+--------------------------------------
+
+.. automodule:: ez_transfer.layers.blackmagic
+    :members:
+    :undoc-members:
+    :show-inheritance:
+
+```
+
+### 4.  Generate doc html
+
+```bash
+$ cd docs/
+$ sh build_docs.sh
+```
@@ -0,0 +1,11 @@
+# Test with sphinx
+pip install sphinx==1.8.6
+pip install sphinx_rtd_theme
+
+rm -rf build
+make html
+
+# upload to oss
+ossconfig=`cat /home/admin/workspace/odps_clt_release_64/conf/atp-public-eki`
+echo 'copy files to atp-modelzoo docs'
+ossutil64 cp -f build oss://atp-modelzoo-sh/release/easynlp/easynlp_docs/ $ossconfig --recursive
Original file line number	Diff line number	Diff line change
`@@ -0,0 +1,2 @@`
	`1`	`+[settings]`
	`2`	`+known_third_party = numpy,requests,rouge,scipy,setuptools,sklearn,sphinx_rtd_theme,torch,tqdm`