aae38db1a5
* add code owners * initial commit, beginnings of AML version of face synthetics search pipeline. * Add download_and_extract_zip Add download capability to FaceSyntheticsDataset Fix face segmentation data prep script. * fix bugs * cleanup launch.json * cleanup launch.json add download capability to FaceSyntheticsDataset add download_and_extract_zip helper * fix file count test * work in progress * work in progress * unify snpe status table and aml training table. * fix experiment referencing * fix experiment referencing * work in progress * fix complete status * fix bugs * fix bug * fix metric key, we have 2, one for remote snpe, and another for aml training pipelines. * pass seed through to the search.py script. * fix use of AzureMLOnBehalfOfCredential * fix bugs * fix bugs * publish new image * fix bugs * fix bugs * fix bug * maerge * revert * new version * fix bugs * rename the top level folder from 'snpe' to 'aml' and move all AML code into this folder except the top level entry point 'aml.py' make the keys returned from the JobCompletionMonitor wait method configurable Rename AmlPartialTrainingEvaluator and make it restartable. Turn off save_pareto_model_weights Remove redundant copy of JobCompletionMonitor * rev the versions. * updates to readme information. * only inference testing targets are 'cpu' and 'snp', trigger the aml partial training by a different key in the config file. * add iteration info * new version. * fix ordering of results from AmlPartialTrainingEvaluator * change AML batch size default to 64 for faster training don't store MODEL_STORAGE_CONNECTION_STRING * Fix bug in merge_status_entity, add more unit test coverage * new version * Store training time in status table. * improve diagram. * save iteration in status table. * pick up new version of archai to fix randomness bug in the EvolutionParetoSearch so that these search jobs are restartable. |
||
---|---|---|
.github | ||
.vscode | ||
archai | ||
confs | ||
docker | ||
docs | ||
research/lm_eval_harness | ||
scripts | ||
tasks | ||
tests | ||
.amltignore | ||
.gitattributes | ||
.gitignore | ||
AUTHORS.md | ||
CODEOWNERS | ||
CODE_OF_CONDUCT.md | ||
CONTRIBUTING.md | ||
LICENSE | ||
NOTICE.md | ||
README.md | ||
SECURITY.md | ||
pyproject.toml | ||
pytest.ini | ||
setup.cfg | ||
setup.py |
README.md
Installation
Archai can be installed through various methods, however, it is recommended to utilize a virtual environment such as conda
or pyenv
for optimal results.
To install Archai via PyPI, the following command can be executed:
pip install archai
Archai requires Python 3.8+ and PyTorch 1.7.0+ to function properly.
For further information, please consult the installation guide.
Quickstart
In this quickstart example, we will apply Archai in Natural Language Processing to find the optimal Pareto-frontier Transformers' configurations according to a set of objectives.
Creating the Search Space
We start by importing the TransformerFlexSearchSpace
class which represents the search space for the Transformer architecture:
from archai.discrete_search.search_spaces.nlp.transformer_flex.search_space import TransformerFlexSearchSpace
space = TransformerFlexSearchSpace("gpt2")
Defining Search Objectives
Next, we define the objectives we want to optimize. In this example, we use NonEmbeddingParamsProxy
, TransformerFlexOnnxLatency
, and TransformerFlexOnnxMemory
to define the objectives:
from archai.discrete_search.api.search_objectives import SearchObjectives
from archai.discrete_search.evaluators.nlp.parameters import NonEmbeddingParamsProxy
from archai.discrete_search.evaluators.nlp.transformer_flex_latency import TransformerFlexOnnxLatency
from archai.discrete_search.evaluators.nlp.transformer_flex_memory import TransformerFlexOnnxMemory
search_objectives = SearchObjectives()
search_objectives.add_objective(
"non_embedding_params",
NonEmbeddingParamsProxy(),
higher_is_better=True,
compute_intensive=False,
constraint=(1e6, 1e9),
)
search_objectives.add_objective(
"onnx_latency",
TransformerFlexOnnxLatency(space),
higher_is_better=False,
compute_intensive=False,
)
search_objectives.add_objective(
"onnx_memory",
TransformerFlexOnnxMemory(space),
higher_is_better=False,
compute_intensive=False,
)
Initializing the Algorithm
We use the EvolutionParetoSearch
algorithm to conduct the search:
from archai.discrete_search.algos.evolution_pareto import EvolutionParetoSearch
algo = EvolutionParetoSearch(
space,
search_objectives,
None,
"tmp",
num_iters=5,
init_num_models=10,
seed=1234,
)
Performing the Search
Finally, we call the search()
method to start the NAS process:
algo.search()
The algorithm will iterate through different network architectures, evaluate their performance based on the defined objectives, and ultimately produce a frontier of Pareto-optimal results.
Tasks
To demonstrate and showcase the capabilities/functionalities of Archai, a set of end-to-end tasks are provided:
Documentation
The official documentation also provides a series of notebooks.
Support
If you have any questions or feedback about the Archai project or the open problems in Neural Architecture Search, please feel free to contact us using the following information:
- Email: archai@microsoft.com
- Website: https://github.com/microsoft/archai/issues
We welcome any questions, feedback, or suggestions you may have and look forward to hearing from you.
Team
Archai has been created and maintained by Shital Shah, Debadeepta Dey, Gustavo de Rosa, Caio Mendes, Piero Kauffmann, Chris Lovett, Allie Del Giorno, Mojan Javaheripi, and Ofer Dekel at Microsoft Research.
Contributions
This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.
When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repositories using our CLA.
This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.
Trademark
This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.
License
This project is released under the MIT License. Please review the file for more details.