WhatTheHack/066-OpenAIFundamentals/Student/Challenge-03.md

55 строки
2.9 KiB
Markdown
Исходник Постоянная ссылка Обычный вид История

[New Hack] 066-Open AI Fundamentals (#690) * Update Challenge-02.md * Update CH-01-PromptEngineering.ipynb delete variable * Update CH-01-PromptEngineering.ipynb delete variable * Update Challenge-03.md * Typo fix Challenge-01.md * Updated comments in 4th challenge * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Move Challenge markdowns * Edit Ch 5 * Edit CH01 * Edit CH01 * Edit Challenge Markdowns * Update Challenge-01.md add more resources for tactics * Update Challenge-01.md add AOAI models link * Delete CH-01-PromptEngineering.ipynb * Add files via upload * Delete CH-03-A-Grounding.ipynb * Delete CH-03-B-Chunking.ipynb * Delete CH-03-C-Embeddings.ipynb * changed challenge 3 files * Rename CH-03-A-Grounding-v2 (1).ipynb to CH-03-A-Grounding.ipynb * Rename CH-03-B-Chunking - v2 (1).ipynb to CH-03-B-Chunking.ipynb * Rename CH-03-C-Embeddings-v2 (1).ipynb to CH-03-C-Embeddings.ipynb * Fixed md formatting * Fixed md formatting * took out the depreciated model call * Fixed md formatting * Fixed md formatting * removed depreciated model * Adding CH3 Coach guides * Rename CH-03-A-Grounding (2).ipynb to CH-03-A-Grounding-Solution.ipynb * Rename CH-03-B-Chunking (2).ipynb to CH-03-B-Chunking-Solution.ipynb * Rename CH-03-C-Embeddings (2).ipynb to CH-03-C-Embeddings-Solution.ipynb * modified CH3-A coach solutions * modified CH3-B coach solutions * modified CH3-C coach solutions * Update Challenge-00.md Removing davinci-003, added VM SKU size. * modified path * added instructions about zip file * added instructions about jupyter notebook files * Update Challenge-01.md revised description * Update Challenge-02.md revised description * Update Challenge-01.md * Update Challenge-02.md * Delete CH-02-ModelComparison.ipynb * Add files via upload * Update CH-02-ModelComparison.ipynb * Update Challenge-01.md * Update Challenge-02.md * Update README.md * Update Challenge-03.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-04.md * Update Challenge-05.md * tookout extra notes * tookout extra notes * added cell outputs * added cell outputs * added pip installs * added pip installs * added pip installs * Update README.md * Update README.md * Update Solution-01.md * deletion of venv and renaming folder * deletion of venv * Delete Solution-07.md * Delete Solution-08.md * Delete Solution-15.md * Delete Solution-14.md * Delete Solution-13.md * Delete Solution-12.md * Delete Solution-11.md * Delete Solution-10.md * Delete Solution-09.md * Delete Solution-06.md * Delete Lectures.pptx * Update Challenge-00.md * Update Challenge-00.md * Delete Solution-04.md * Delete Solution-05.md * Update Challenge-00.md * Update Challenge-00.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-05.md * Update Challenge-00.md * added whitelisted words * Edit Coach and Main Markdowns * wordsmith readme and CH0.md * Update README.md * fixed spelling mistakes * fixed spelling mistakes * added CH3 solution md * Update Solution-01.md update md * Update Solution-02.md update guide * Update Solution-01.md * Update Solution-02.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * removed extra markdown files * Update Solution-03.md Replaced <Title of Challenge> with Grounding, Chunking, and Embedding * Update Challenge-00.md Fixed typo * Update Challenge-01.md Fixed typo * Update Challenge-04.md Fixed typo * Update README.md FIxed some typos * changed ! to Note: in Challenge-01.md * added title and some additional guidance on python in Solution-00.md * added sample env file * removed suggested agenda section * added conda version and took out depreciated model mention * added project env tip for vscode * added tips to get credentials from azure portal for aoai * took out depreciated model mentions * Creating challenge 2 as optional * Creating challenge 2 as optional * Update Challenge 5 * Update CH-01-PromptEngineering-Solution.ipynb * Add files via upload * fix solution add variable * Fixed typos Fixed typos. Also changed "challenges" within a challenge to "tasks" to make it clearer * Update README.md Added WTH to the title as per standard. Wordsmithed intro paragraph to indicate this hack is designed to be self-administered vs. regular WTH format. Fixed double spacing in Learning Objectives. * Update README.md Fixed broken ToC links (used to go to /Challenges subfolder) * Update Challenge-00.md Added spaces to the hack title in the intro section. Fixed the broken links in the Common Prereq section * Update Challenge-01.md Added "back" link to C0. Added descriptive text to two hyperlinks for models. Bolded the "note" * Update Challenge-05.md Removed phantom link to C6 * Update Challenge 00-02 for clarity * Clarify .env usage in CH-01-PromptEngineering * Update .wordlist.txt * Update Challenge-00.md Updated Challenge 00 with clearer organization and temporary link to the Resources.zip file * Update Challenge-00.md Added clarification to C0 regarding set up of Jupyter Notebooks. * Update Challenge-00.md fixed typos * removed repeat instructions for requirements.txt * WTH Feedback Changes * Update Challenge-02.md * Update Challenge-02.md * Update sample-env.txt * Update Solution-02.md * update to add clarifications * Update Challenge-01.md only one model is needed, add clarification * Update Solution-01.md add instructions * Update CH-02-ModelComparison.ipynb * changed version to be set to the env variable * added ch4 feedback from participants * changed format of success criteria * changed spelling issue * Edit CH 05 and Coach Guide * changes coach md files * Update CH-01-PromptEngineering.ipynb Fixed grammar issues * Update Challenge-02.md Cleaned up location of Resource.zip file (it's covered in Challenge 00) * Update Challenge-01.md code blocked folder name * Update Challenge-03.md code blocked folder location * Update Challenge-04.md folder location in code block * Update Challenge-04.md Grammar fix in Intro paragraph * Update Challenge-05.md folder to code block * Update Solution-01.md * Update Solution-02.md Fixed title to match Challenge 2 title. * Update Solution-03.md * Update Solution-04.md Fixed incorrect challenge title & navigation links * Update Solution-04.md * Update README.md moved Suggested Hack Agenda back to its proper location in the template. * Update README.md Remove "Additional Coach Prereqs" as there are none. Cleaned up "Repo Contents" section which was referring to older folders that don't exist. * Update README.md wordsmithed Suggested Hack Agenda * Update README.md * Update README.md wordsmithed the intro section and commented out the link to lecture deck until one is posted. * Update Solution-01.md added links to the solution notebooks in the coach guide * Update Solution-01.md * Update Solution-01.md * Update Solution-02.md added links to solution guide * Update Solution-03.md added link to solution notebooks * Update Solution-04.md * Delete 066-OpenAIFundamentals/Coach/Solutions/.gitkeep removed .gitkeep folder placeholder file * Update Challenge-00.md test task list toc at start * Update Challenge-00.md * Update Challenge-00.md fixed toc anchor links * Update Challenge-00.md Edited link to Free Azure Trial * Update README.md * Update Challenge-02.md playing with formatting * Update Challenge-02.md * Update Challenge-03.md formatting * Update Challenge-04.md updated reference from "3" jupyter notebooks to 2. * Update Challenge-05.md fixed reference to single notebook * Update Challenge-05.md wordsmithed success criteria * Update README.md commented out ACS cost * Update README.md * Update CH-02-ModelComparison.ipynb Changed "Student Challenge" to "Student Task" - 4 instances * Update CH-03-A-Grounding.ipynb grammar * Update CH-03-A-Grounding.ipynb * Update CH-03-A-Grounding.ipynb made SC a top level section * Update CH-03-B-Chunking.ipynb byte pair encoding (BPE) -> full title first, followed by abbreviation. Then you can use the abbreviation for subsequent mentions. * Update CH-03-B-Chunking.ipynb removed gratuitous "so" * Update CH-03-B-Chunking.ipynb clarify where PDF file is located * Update CH-03-B-Chunking.ipynb Removed task text from section headers, promoted Success Criteria to section header ## * Update CH-03-C-Embeddings.ipynb put embedding types in bullet list. promoted Success Criteria to section header. * Update CH-03-C-Embeddings.ipynb grammar on embedding types * Update CH-03-C-Embeddings.ipynb • Student Task #1 text out of section header. • Make 3.1 Generate Embeddings for a CSV file a section header • Make Student Task #2 a section header * Update CH-03-C-Embeddings.ipynb o Clarify location of CSV file -> /Resources/data/Automobile.csv * Update CH-04-A-RAG_for_structured_data.ipynb clean up Use case section * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb grammar * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb fixed section header for Document Zone task * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-05-ResponsibleAI.ipynb text anchor link fix * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb bolded Note on a new line * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb Descriptive text for resource links * Update CH-02-ModelComparison.ipynb Fixed text for how to request quota increases. * Update CH-02-ModelComparison.ipynb added note in bold * Update CH-02-ModelComparison.ipynb * Update CH-03-C-Embeddings.ipynb Fixed grammar on Text Search Embeddings * Update CH-03-C-Embeddings.ipynb Student Challenge #2 => Student Task #2 * Update CH-03-C-Embeddings.ipynb fixed locationi * Update CH-04-A-RAG_for_structured_data.ipynb Summarizer => Text summarization * Update CH-04-A-RAG_for_structured_data.ipynb added challenge number to the title * Update CH-04-A-RAG_for_structured_data.ipynb Added a note about possibly needing to update the path * Update CH-04-A-RAG_for_structured_data.ipynb remove comment * Update CH-04-B-RAG_for_unstructured_data.ipynb added title to challenge * Update CH-04-B-RAG_for_unstructured_data.ipynb * Added additional text clarifying paths to data files and what the student goals are for each notebook in C4 * Update CH-05-ResponsibleAI.ipynb shortened title to match student guide title * try to fix anchor links again * remove broken anchor links * student challenge to student task * Fixed table errors in Notebook C2 * Updated text describing ENV file in Ch1 notebook. Depreciated -> deprecated * updated ENV language --------- Co-authored-by: melody-N07 <70866315+melody-N07@users.noreply.github.com> Co-authored-by: Devanshi Thakar <33441411+devanshithakar12@users.noreply.github.com> Co-authored-by: Alexis Joseph <36856999+alexistj@users.noreply.github.com> Co-authored-by: ShivaKumarChittamuru <shchitt@microsoft.com> Co-authored-by: Amanda Wong <wongg.amanda@gmail.com> Co-authored-by: Amanda Wong <amawong@microsoft.com> Co-authored-by: Devanshi Thakar <thakardevanshi98@gmail.com> Co-authored-by: juanlldc <125307653+juanlldc@users.noreply.github.com> Co-authored-by: Pete Rodriguez <perktime@users.noreply.github.com> Co-authored-by: wongamanda <wongamanda@uchicago.edu> Co-authored-by: Peter C. Laudati <plaudati@hotmail.com> Co-authored-by: Israel Ekpo <37992436+izzymsft@users.noreply.github.com>
2023-11-07 06:03:09 +03:00
# Challenge 03 - Grounding, Chunking, and Embedding
[< Previous Challenge](./Challenge-02.md) - **[Home](../README.md)** - [Next Challenge >](./Challenge-04.md)
## Pre-requisites
* Azure Cognitive Search resource for indexing and retrieving relevant information
* Azure OpenAI service for Generative AI Models and Embedding Models
[Hack Update] 066-OpenAIFundamental-v2 (#798) * added date to readme * added codespace files * added CH1 ntbk codespace changes to original * added CH3A ntbk codespace changes to original * added CH3B ntbk codespace changes to original * added CH3C ntbk codespace changes to original * added CH4A ntbk codespace changes to original * added CH4B ntbk codespace changes to original * added CH3A solutions ntbk codespace changes to original * added CH3B solutions ntbk codespace changes to original * added CH3C solutions ntbk codespace changes to original * changed CH3A solutions * deleted CH3B Solutions CS version * deleted CH3C Solutions CS version and changed CH3C Solutions file * deleted CH3A,CH3B CS versions * deleted CH4A,CH4B CS files * deleted old CS files * Update Challenge-00-CS.md resolved spelling issue ".txt" * First draft of updates to C0 to include GH codespaces * minor update to C0 * minor update to C0. Removed redundant line about "being good to go" * cleaned up redundant instructions. Referred to .env file instead of sample_env.txt. <-- may need to change this back to follow best practices for including an .env file in a Git repo * minor C0 update * update & merged C1 * updated C2 * Updated C3 to cover codespace * Updated C4 & C5 to cover codespaces. Also corrected spelling in C1 & 2 * Updated C0 to refer to .env.sample file with instructions for student to rename it * merged/renamed .env-CS (from Dec fluency event) to .env.sample added a . to .gitignore Removed old env file * renamed Notebooks folder to lowercase notebooks * Added backticks to all references to .env file in C1-C4 * Removed all "CS" files from the December event fork * cleaned up references to /notebooks folder in local workstation instructions. Removed extra characters from hack home page README.md * updated the coach guide to refer to the availability of GitHub Codespace. * added link to Codespace repo to challenge 00! * Added devcontainer definition for GH Codespaces, and a README file to the /Student/Resources folder that will act as the homepage for the codespace repo * updated prereqs listed on the hack homepage --------- Co-authored-by: Peter C. Laudati <plaudati@hotmail.com>
2024-04-24 22:49:59 +03:00
* Add required credentials of above resources in `.env` file
* Install the required libraries in the `requirements.txt` file via ```pip install -r requirements.txt ``` if you have not already.
[New Hack] 066-Open AI Fundamentals (#690) * Update Challenge-02.md * Update CH-01-PromptEngineering.ipynb delete variable * Update CH-01-PromptEngineering.ipynb delete variable * Update Challenge-03.md * Typo fix Challenge-01.md * Updated comments in 4th challenge * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Move Challenge markdowns * Edit Ch 5 * Edit CH01 * Edit CH01 * Edit Challenge Markdowns * Update Challenge-01.md add more resources for tactics * Update Challenge-01.md add AOAI models link * Delete CH-01-PromptEngineering.ipynb * Add files via upload * Delete CH-03-A-Grounding.ipynb * Delete CH-03-B-Chunking.ipynb * Delete CH-03-C-Embeddings.ipynb * changed challenge 3 files * Rename CH-03-A-Grounding-v2 (1).ipynb to CH-03-A-Grounding.ipynb * Rename CH-03-B-Chunking - v2 (1).ipynb to CH-03-B-Chunking.ipynb * Rename CH-03-C-Embeddings-v2 (1).ipynb to CH-03-C-Embeddings.ipynb * Fixed md formatting * Fixed md formatting * took out the depreciated model call * Fixed md formatting * Fixed md formatting * removed depreciated model * Adding CH3 Coach guides * Rename CH-03-A-Grounding (2).ipynb to CH-03-A-Grounding-Solution.ipynb * Rename CH-03-B-Chunking (2).ipynb to CH-03-B-Chunking-Solution.ipynb * Rename CH-03-C-Embeddings (2).ipynb to CH-03-C-Embeddings-Solution.ipynb * modified CH3-A coach solutions * modified CH3-B coach solutions * modified CH3-C coach solutions * Update Challenge-00.md Removing davinci-003, added VM SKU size. * modified path * added instructions about zip file * added instructions about jupyter notebook files * Update Challenge-01.md revised description * Update Challenge-02.md revised description * Update Challenge-01.md * Update Challenge-02.md * Delete CH-02-ModelComparison.ipynb * Add files via upload * Update CH-02-ModelComparison.ipynb * Update Challenge-01.md * Update Challenge-02.md * Update README.md * Update Challenge-03.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-04.md * Update Challenge-05.md * tookout extra notes * tookout extra notes * added cell outputs * added cell outputs * added pip installs * added pip installs * added pip installs * Update README.md * Update README.md * Update Solution-01.md * deletion of venv and renaming folder * deletion of venv * Delete Solution-07.md * Delete Solution-08.md * Delete Solution-15.md * Delete Solution-14.md * Delete Solution-13.md * Delete Solution-12.md * Delete Solution-11.md * Delete Solution-10.md * Delete Solution-09.md * Delete Solution-06.md * Delete Lectures.pptx * Update Challenge-00.md * Update Challenge-00.md * Delete Solution-04.md * Delete Solution-05.md * Update Challenge-00.md * Update Challenge-00.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-05.md * Update Challenge-00.md * added whitelisted words * Edit Coach and Main Markdowns * wordsmith readme and CH0.md * Update README.md * fixed spelling mistakes * fixed spelling mistakes * added CH3 solution md * Update Solution-01.md update md * Update Solution-02.md update guide * Update Solution-01.md * Update Solution-02.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * removed extra markdown files * Update Solution-03.md Replaced <Title of Challenge> with Grounding, Chunking, and Embedding * Update Challenge-00.md Fixed typo * Update Challenge-01.md Fixed typo * Update Challenge-04.md Fixed typo * Update README.md FIxed some typos * changed ! to Note: in Challenge-01.md * added title and some additional guidance on python in Solution-00.md * added sample env file * removed suggested agenda section * added conda version and took out depreciated model mention * added project env tip for vscode * added tips to get credentials from azure portal for aoai * took out depreciated model mentions * Creating challenge 2 as optional * Creating challenge 2 as optional * Update Challenge 5 * Update CH-01-PromptEngineering-Solution.ipynb * Add files via upload * fix solution add variable * Fixed typos Fixed typos. Also changed "challenges" within a challenge to "tasks" to make it clearer * Update README.md Added WTH to the title as per standard. Wordsmithed intro paragraph to indicate this hack is designed to be self-administered vs. regular WTH format. Fixed double spacing in Learning Objectives. * Update README.md Fixed broken ToC links (used to go to /Challenges subfolder) * Update Challenge-00.md Added spaces to the hack title in the intro section. Fixed the broken links in the Common Prereq section * Update Challenge-01.md Added "back" link to C0. Added descriptive text to two hyperlinks for models. Bolded the "note" * Update Challenge-05.md Removed phantom link to C6 * Update Challenge 00-02 for clarity * Clarify .env usage in CH-01-PromptEngineering * Update .wordlist.txt * Update Challenge-00.md Updated Challenge 00 with clearer organization and temporary link to the Resources.zip file * Update Challenge-00.md Added clarification to C0 regarding set up of Jupyter Notebooks. * Update Challenge-00.md fixed typos * removed repeat instructions for requirements.txt * WTH Feedback Changes * Update Challenge-02.md * Update Challenge-02.md * Update sample-env.txt * Update Solution-02.md * update to add clarifications * Update Challenge-01.md only one model is needed, add clarification * Update Solution-01.md add instructions * Update CH-02-ModelComparison.ipynb * changed version to be set to the env variable * added ch4 feedback from participants * changed format of success criteria * changed spelling issue * Edit CH 05 and Coach Guide * changes coach md files * Update CH-01-PromptEngineering.ipynb Fixed grammar issues * Update Challenge-02.md Cleaned up location of Resource.zip file (it's covered in Challenge 00) * Update Challenge-01.md code blocked folder name * Update Challenge-03.md code blocked folder location * Update Challenge-04.md folder location in code block * Update Challenge-04.md Grammar fix in Intro paragraph * Update Challenge-05.md folder to code block * Update Solution-01.md * Update Solution-02.md Fixed title to match Challenge 2 title. * Update Solution-03.md * Update Solution-04.md Fixed incorrect challenge title & navigation links * Update Solution-04.md * Update README.md moved Suggested Hack Agenda back to its proper location in the template. * Update README.md Remove "Additional Coach Prereqs" as there are none. Cleaned up "Repo Contents" section which was referring to older folders that don't exist. * Update README.md wordsmithed Suggested Hack Agenda * Update README.md * Update README.md wordsmithed the intro section and commented out the link to lecture deck until one is posted. * Update Solution-01.md added links to the solution notebooks in the coach guide * Update Solution-01.md * Update Solution-01.md * Update Solution-02.md added links to solution guide * Update Solution-03.md added link to solution notebooks * Update Solution-04.md * Delete 066-OpenAIFundamentals/Coach/Solutions/.gitkeep removed .gitkeep folder placeholder file * Update Challenge-00.md test task list toc at start * Update Challenge-00.md * Update Challenge-00.md fixed toc anchor links * Update Challenge-00.md Edited link to Free Azure Trial * Update README.md * Update Challenge-02.md playing with formatting * Update Challenge-02.md * Update Challenge-03.md formatting * Update Challenge-04.md updated reference from "3" jupyter notebooks to 2. * Update Challenge-05.md fixed reference to single notebook * Update Challenge-05.md wordsmithed success criteria * Update README.md commented out ACS cost * Update README.md * Update CH-02-ModelComparison.ipynb Changed "Student Challenge" to "Student Task" - 4 instances * Update CH-03-A-Grounding.ipynb grammar * Update CH-03-A-Grounding.ipynb * Update CH-03-A-Grounding.ipynb made SC a top level section * Update CH-03-B-Chunking.ipynb byte pair encoding (BPE) -> full title first, followed by abbreviation. Then you can use the abbreviation for subsequent mentions. * Update CH-03-B-Chunking.ipynb removed gratuitous "so" * Update CH-03-B-Chunking.ipynb clarify where PDF file is located * Update CH-03-B-Chunking.ipynb Removed task text from section headers, promoted Success Criteria to section header ## * Update CH-03-C-Embeddings.ipynb put embedding types in bullet list. promoted Success Criteria to section header. * Update CH-03-C-Embeddings.ipynb grammar on embedding types * Update CH-03-C-Embeddings.ipynb • Student Task #1 text out of section header. • Make 3.1 Generate Embeddings for a CSV file a section header • Make Student Task #2 a section header * Update CH-03-C-Embeddings.ipynb o Clarify location of CSV file -> /Resources/data/Automobile.csv * Update CH-04-A-RAG_for_structured_data.ipynb clean up Use case section * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb grammar * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb fixed section header for Document Zone task * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-05-ResponsibleAI.ipynb text anchor link fix * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb bolded Note on a new line * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb Descriptive text for resource links * Update CH-02-ModelComparison.ipynb Fixed text for how to request quota increases. * Update CH-02-ModelComparison.ipynb added note in bold * Update CH-02-ModelComparison.ipynb * Update CH-03-C-Embeddings.ipynb Fixed grammar on Text Search Embeddings * Update CH-03-C-Embeddings.ipynb Student Challenge #2 => Student Task #2 * Update CH-03-C-Embeddings.ipynb fixed locationi * Update CH-04-A-RAG_for_structured_data.ipynb Summarizer => Text summarization * Update CH-04-A-RAG_for_structured_data.ipynb added challenge number to the title * Update CH-04-A-RAG_for_structured_data.ipynb Added a note about possibly needing to update the path * Update CH-04-A-RAG_for_structured_data.ipynb remove comment * Update CH-04-B-RAG_for_unstructured_data.ipynb added title to challenge * Update CH-04-B-RAG_for_unstructured_data.ipynb * Added additional text clarifying paths to data files and what the student goals are for each notebook in C4 * Update CH-05-ResponsibleAI.ipynb shortened title to match student guide title * try to fix anchor links again * remove broken anchor links * student challenge to student task * Fixed table errors in Notebook C2 * Updated text describing ENV file in Ch1 notebook. Depreciated -> deprecated * updated ENV language --------- Co-authored-by: melody-N07 <70866315+melody-N07@users.noreply.github.com> Co-authored-by: Devanshi Thakar <33441411+devanshithakar12@users.noreply.github.com> Co-authored-by: Alexis Joseph <36856999+alexistj@users.noreply.github.com> Co-authored-by: ShivaKumarChittamuru <shchitt@microsoft.com> Co-authored-by: Amanda Wong <wongg.amanda@gmail.com> Co-authored-by: Amanda Wong <amawong@microsoft.com> Co-authored-by: Devanshi Thakar <thakardevanshi98@gmail.com> Co-authored-by: juanlldc <125307653+juanlldc@users.noreply.github.com> Co-authored-by: Pete Rodriguez <perktime@users.noreply.github.com> Co-authored-by: wongamanda <wongamanda@uchicago.edu> Co-authored-by: Peter C. Laudati <plaudati@hotmail.com> Co-authored-by: Israel Ekpo <37992436+izzymsft@users.noreply.github.com>
2023-11-07 06:03:09 +03:00
## Introduction
When working with large language models, it is important to understand how to ground them with the right data. In addition, you will take a look at how to deal with token limits when you have a lot of data. Finally, you will experiment with embeddings. This challenge will teach you all the fundamental concepts - Grounding, Chunking, Embedding - before you see them in play in Challenge 4. Below are brief introductions to the concepts you will learn.
Grounding is a technique used when you want the model to return reliable answers to a given question.
Chunking is the process of breaking down a large document. It helps limit the amount of information we pass into the model.
An embedding is an information dense representation of the semantic meaning of a piece of text.
## Description
Questions you should be able to answer by the end of the challenge:
- Why is grounding important and how can you ground a LLM model?
- What is a token limit?
- How can you deal with token limits? What are techniques of chunking?
- What do embedding help accomplish?
[Hack Update] 066-OpenAIFundamental-v2 (#798) * added date to readme * added codespace files * added CH1 ntbk codespace changes to original * added CH3A ntbk codespace changes to original * added CH3B ntbk codespace changes to original * added CH3C ntbk codespace changes to original * added CH4A ntbk codespace changes to original * added CH4B ntbk codespace changes to original * added CH3A solutions ntbk codespace changes to original * added CH3B solutions ntbk codespace changes to original * added CH3C solutions ntbk codespace changes to original * changed CH3A solutions * deleted CH3B Solutions CS version * deleted CH3C Solutions CS version and changed CH3C Solutions file * deleted CH3A,CH3B CS versions * deleted CH4A,CH4B CS files * deleted old CS files * Update Challenge-00-CS.md resolved spelling issue ".txt" * First draft of updates to C0 to include GH codespaces * minor update to C0 * minor update to C0. Removed redundant line about "being good to go" * cleaned up redundant instructions. Referred to .env file instead of sample_env.txt. <-- may need to change this back to follow best practices for including an .env file in a Git repo * minor C0 update * update & merged C1 * updated C2 * Updated C3 to cover codespace * Updated C4 & C5 to cover codespaces. Also corrected spelling in C1 & 2 * Updated C0 to refer to .env.sample file with instructions for student to rename it * merged/renamed .env-CS (from Dec fluency event) to .env.sample added a . to .gitignore Removed old env file * renamed Notebooks folder to lowercase notebooks * Added backticks to all references to .env file in C1-C4 * Removed all "CS" files from the December event fork * cleaned up references to /notebooks folder in local workstation instructions. Removed extra characters from hack home page README.md * updated the coach guide to refer to the availability of GitHub Codespace. * added link to Codespace repo to challenge 00! * Added devcontainer definition for GH Codespaces, and a README file to the /Student/Resources folder that will act as the homepage for the codespace repo * updated prereqs listed on the hack homepage --------- Co-authored-by: Peter C. Laudati <plaudati@hotmail.com>
2024-04-24 22:49:59 +03:00
You will run the following three Jupyter notebooks for this challenge:
[New Hack] 066-Open AI Fundamentals (#690) * Update Challenge-02.md * Update CH-01-PromptEngineering.ipynb delete variable * Update CH-01-PromptEngineering.ipynb delete variable * Update Challenge-03.md * Typo fix Challenge-01.md * Updated comments in 4th challenge * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Move Challenge markdowns * Edit Ch 5 * Edit CH01 * Edit CH01 * Edit Challenge Markdowns * Update Challenge-01.md add more resources for tactics * Update Challenge-01.md add AOAI models link * Delete CH-01-PromptEngineering.ipynb * Add files via upload * Delete CH-03-A-Grounding.ipynb * Delete CH-03-B-Chunking.ipynb * Delete CH-03-C-Embeddings.ipynb * changed challenge 3 files * Rename CH-03-A-Grounding-v2 (1).ipynb to CH-03-A-Grounding.ipynb * Rename CH-03-B-Chunking - v2 (1).ipynb to CH-03-B-Chunking.ipynb * Rename CH-03-C-Embeddings-v2 (1).ipynb to CH-03-C-Embeddings.ipynb * Fixed md formatting * Fixed md formatting * took out the depreciated model call * Fixed md formatting * Fixed md formatting * removed depreciated model * Adding CH3 Coach guides * Rename CH-03-A-Grounding (2).ipynb to CH-03-A-Grounding-Solution.ipynb * Rename CH-03-B-Chunking (2).ipynb to CH-03-B-Chunking-Solution.ipynb * Rename CH-03-C-Embeddings (2).ipynb to CH-03-C-Embeddings-Solution.ipynb * modified CH3-A coach solutions * modified CH3-B coach solutions * modified CH3-C coach solutions * Update Challenge-00.md Removing davinci-003, added VM SKU size. * modified path * added instructions about zip file * added instructions about jupyter notebook files * Update Challenge-01.md revised description * Update Challenge-02.md revised description * Update Challenge-01.md * Update Challenge-02.md * Delete CH-02-ModelComparison.ipynb * Add files via upload * Update CH-02-ModelComparison.ipynb * Update Challenge-01.md * Update Challenge-02.md * Update README.md * Update Challenge-03.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-04.md * Update Challenge-05.md * tookout extra notes * tookout extra notes * added cell outputs * added cell outputs * added pip installs * added pip installs * added pip installs * Update README.md * Update README.md * Update Solution-01.md * deletion of venv and renaming folder * deletion of venv * Delete Solution-07.md * Delete Solution-08.md * Delete Solution-15.md * Delete Solution-14.md * Delete Solution-13.md * Delete Solution-12.md * Delete Solution-11.md * Delete Solution-10.md * Delete Solution-09.md * Delete Solution-06.md * Delete Lectures.pptx * Update Challenge-00.md * Update Challenge-00.md * Delete Solution-04.md * Delete Solution-05.md * Update Challenge-00.md * Update Challenge-00.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-05.md * Update Challenge-00.md * added whitelisted words * Edit Coach and Main Markdowns * wordsmith readme and CH0.md * Update README.md * fixed spelling mistakes * fixed spelling mistakes * added CH3 solution md * Update Solution-01.md update md * Update Solution-02.md update guide * Update Solution-01.md * Update Solution-02.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * removed extra markdown files * Update Solution-03.md Replaced <Title of Challenge> with Grounding, Chunking, and Embedding * Update Challenge-00.md Fixed typo * Update Challenge-01.md Fixed typo * Update Challenge-04.md Fixed typo * Update README.md FIxed some typos * changed ! to Note: in Challenge-01.md * added title and some additional guidance on python in Solution-00.md * added sample env file * removed suggested agenda section * added conda version and took out depreciated model mention * added project env tip for vscode * added tips to get credentials from azure portal for aoai * took out depreciated model mentions * Creating challenge 2 as optional * Creating challenge 2 as optional * Update Challenge 5 * Update CH-01-PromptEngineering-Solution.ipynb * Add files via upload * fix solution add variable * Fixed typos Fixed typos. Also changed "challenges" within a challenge to "tasks" to make it clearer * Update README.md Added WTH to the title as per standard. Wordsmithed intro paragraph to indicate this hack is designed to be self-administered vs. regular WTH format. Fixed double spacing in Learning Objectives. * Update README.md Fixed broken ToC links (used to go to /Challenges subfolder) * Update Challenge-00.md Added spaces to the hack title in the intro section. Fixed the broken links in the Common Prereq section * Update Challenge-01.md Added "back" link to C0. Added descriptive text to two hyperlinks for models. Bolded the "note" * Update Challenge-05.md Removed phantom link to C6 * Update Challenge 00-02 for clarity * Clarify .env usage in CH-01-PromptEngineering * Update .wordlist.txt * Update Challenge-00.md Updated Challenge 00 with clearer organization and temporary link to the Resources.zip file * Update Challenge-00.md Added clarification to C0 regarding set up of Jupyter Notebooks. * Update Challenge-00.md fixed typos * removed repeat instructions for requirements.txt * WTH Feedback Changes * Update Challenge-02.md * Update Challenge-02.md * Update sample-env.txt * Update Solution-02.md * update to add clarifications * Update Challenge-01.md only one model is needed, add clarification * Update Solution-01.md add instructions * Update CH-02-ModelComparison.ipynb * changed version to be set to the env variable * added ch4 feedback from participants * changed format of success criteria * changed spelling issue * Edit CH 05 and Coach Guide * changes coach md files * Update CH-01-PromptEngineering.ipynb Fixed grammar issues * Update Challenge-02.md Cleaned up location of Resource.zip file (it's covered in Challenge 00) * Update Challenge-01.md code blocked folder name * Update Challenge-03.md code blocked folder location * Update Challenge-04.md folder location in code block * Update Challenge-04.md Grammar fix in Intro paragraph * Update Challenge-05.md folder to code block * Update Solution-01.md * Update Solution-02.md Fixed title to match Challenge 2 title. * Update Solution-03.md * Update Solution-04.md Fixed incorrect challenge title & navigation links * Update Solution-04.md * Update README.md moved Suggested Hack Agenda back to its proper location in the template. * Update README.md Remove "Additional Coach Prereqs" as there are none. Cleaned up "Repo Contents" section which was referring to older folders that don't exist. * Update README.md wordsmithed Suggested Hack Agenda * Update README.md * Update README.md wordsmithed the intro section and commented out the link to lecture deck until one is posted. * Update Solution-01.md added links to the solution notebooks in the coach guide * Update Solution-01.md * Update Solution-01.md * Update Solution-02.md added links to solution guide * Update Solution-03.md added link to solution notebooks * Update Solution-04.md * Delete 066-OpenAIFundamentals/Coach/Solutions/.gitkeep removed .gitkeep folder placeholder file * Update Challenge-00.md test task list toc at start * Update Challenge-00.md * Update Challenge-00.md fixed toc anchor links * Update Challenge-00.md Edited link to Free Azure Trial * Update README.md * Update Challenge-02.md playing with formatting * Update Challenge-02.md * Update Challenge-03.md formatting * Update Challenge-04.md updated reference from "3" jupyter notebooks to 2. * Update Challenge-05.md fixed reference to single notebook * Update Challenge-05.md wordsmithed success criteria * Update README.md commented out ACS cost * Update README.md * Update CH-02-ModelComparison.ipynb Changed "Student Challenge" to "Student Task" - 4 instances * Update CH-03-A-Grounding.ipynb grammar * Update CH-03-A-Grounding.ipynb * Update CH-03-A-Grounding.ipynb made SC a top level section * Update CH-03-B-Chunking.ipynb byte pair encoding (BPE) -> full title first, followed by abbreviation. Then you can use the abbreviation for subsequent mentions. * Update CH-03-B-Chunking.ipynb removed gratuitous "so" * Update CH-03-B-Chunking.ipynb clarify where PDF file is located * Update CH-03-B-Chunking.ipynb Removed task text from section headers, promoted Success Criteria to section header ## * Update CH-03-C-Embeddings.ipynb put embedding types in bullet list. promoted Success Criteria to section header. * Update CH-03-C-Embeddings.ipynb grammar on embedding types * Update CH-03-C-Embeddings.ipynb • Student Task #1 text out of section header. • Make 3.1 Generate Embeddings for a CSV file a section header • Make Student Task #2 a section header * Update CH-03-C-Embeddings.ipynb o Clarify location of CSV file -> /Resources/data/Automobile.csv * Update CH-04-A-RAG_for_structured_data.ipynb clean up Use case section * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb grammar * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb fixed section header for Document Zone task * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-05-ResponsibleAI.ipynb text anchor link fix * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb bolded Note on a new line * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb Descriptive text for resource links * Update CH-02-ModelComparison.ipynb Fixed text for how to request quota increases. * Update CH-02-ModelComparison.ipynb added note in bold * Update CH-02-ModelComparison.ipynb * Update CH-03-C-Embeddings.ipynb Fixed grammar on Text Search Embeddings * Update CH-03-C-Embeddings.ipynb Student Challenge #2 => Student Task #2 * Update CH-03-C-Embeddings.ipynb fixed locationi * Update CH-04-A-RAG_for_structured_data.ipynb Summarizer => Text summarization * Update CH-04-A-RAG_for_structured_data.ipynb added challenge number to the title * Update CH-04-A-RAG_for_structured_data.ipynb Added a note about possibly needing to update the path * Update CH-04-A-RAG_for_structured_data.ipynb remove comment * Update CH-04-B-RAG_for_unstructured_data.ipynb added title to challenge * Update CH-04-B-RAG_for_unstructured_data.ipynb * Added additional text clarifying paths to data files and what the student goals are for each notebook in C4 * Update CH-05-ResponsibleAI.ipynb shortened title to match student guide title * try to fix anchor links again * remove broken anchor links * student challenge to student task * Fixed table errors in Notebook C2 * Updated text describing ENV file in Ch1 notebook. Depreciated -> deprecated * updated ENV language --------- Co-authored-by: melody-N07 <70866315+melody-N07@users.noreply.github.com> Co-authored-by: Devanshi Thakar <33441411+devanshithakar12@users.noreply.github.com> Co-authored-by: Alexis Joseph <36856999+alexistj@users.noreply.github.com> Co-authored-by: ShivaKumarChittamuru <shchitt@microsoft.com> Co-authored-by: Amanda Wong <wongg.amanda@gmail.com> Co-authored-by: Amanda Wong <amawong@microsoft.com> Co-authored-by: Devanshi Thakar <thakardevanshi98@gmail.com> Co-authored-by: juanlldc <125307653+juanlldc@users.noreply.github.com> Co-authored-by: Pete Rodriguez <perktime@users.noreply.github.com> Co-authored-by: wongamanda <wongamanda@uchicago.edu> Co-authored-by: Peter C. Laudati <plaudati@hotmail.com> Co-authored-by: Israel Ekpo <37992436+izzymsft@users.noreply.github.com>
2023-11-07 06:03:09 +03:00
* `CH-03-A-Grounding.ipynb`
* `CH-03-B-Chunking.ipynb`
* `CH-03-C-Embeddings.ipynb`
[Hack Update] 066-OpenAIFundamental-v2 (#798) * added date to readme * added codespace files * added CH1 ntbk codespace changes to original * added CH3A ntbk codespace changes to original * added CH3B ntbk codespace changes to original * added CH3C ntbk codespace changes to original * added CH4A ntbk codespace changes to original * added CH4B ntbk codespace changes to original * added CH3A solutions ntbk codespace changes to original * added CH3B solutions ntbk codespace changes to original * added CH3C solutions ntbk codespace changes to original * changed CH3A solutions * deleted CH3B Solutions CS version * deleted CH3C Solutions CS version and changed CH3C Solutions file * deleted CH3A,CH3B CS versions * deleted CH4A,CH4B CS files * deleted old CS files * Update Challenge-00-CS.md resolved spelling issue ".txt" * First draft of updates to C0 to include GH codespaces * minor update to C0 * minor update to C0. Removed redundant line about "being good to go" * cleaned up redundant instructions. Referred to .env file instead of sample_env.txt. <-- may need to change this back to follow best practices for including an .env file in a Git repo * minor C0 update * update & merged C1 * updated C2 * Updated C3 to cover codespace * Updated C4 & C5 to cover codespaces. Also corrected spelling in C1 & 2 * Updated C0 to refer to .env.sample file with instructions for student to rename it * merged/renamed .env-CS (from Dec fluency event) to .env.sample added a . to .gitignore Removed old env file * renamed Notebooks folder to lowercase notebooks * Added backticks to all references to .env file in C1-C4 * Removed all "CS" files from the December event fork * cleaned up references to /notebooks folder in local workstation instructions. Removed extra characters from hack home page README.md * updated the coach guide to refer to the availability of GitHub Codespace. * added link to Codespace repo to challenge 00! * Added devcontainer definition for GH Codespaces, and a README file to the /Student/Resources folder that will act as the homepage for the codespace repo * updated prereqs listed on the hack homepage --------- Co-authored-by: Peter C. Laudati <plaudati@hotmail.com>
2024-04-24 22:49:59 +03:00
These files can be found in your Codespace under the `/notebooks` folder.
If you are working locally or in the Cloud, you can find them in the `/notebooks` folder of `Resources.zip` file.
To run a Jupyter notebook, navigate to it in your Codespace or open it in VS Code on your local workstation. You will find further instructions for the challenge, as well as in-line code blocks that you will interact with to complete the tasks for the challenge. Return here to the student guide after completing all tasks in the Jupyter notebook to validate you have met the [success criteria](#success-criteria) below for this challenge.
[New Hack] 066-Open AI Fundamentals (#690) * Update Challenge-02.md * Update CH-01-PromptEngineering.ipynb delete variable * Update CH-01-PromptEngineering.ipynb delete variable * Update Challenge-03.md * Typo fix Challenge-01.md * Updated comments in 4th challenge * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Move Challenge markdowns * Edit Ch 5 * Edit CH01 * Edit CH01 * Edit Challenge Markdowns * Update Challenge-01.md add more resources for tactics * Update Challenge-01.md add AOAI models link * Delete CH-01-PromptEngineering.ipynb * Add files via upload * Delete CH-03-A-Grounding.ipynb * Delete CH-03-B-Chunking.ipynb * Delete CH-03-C-Embeddings.ipynb * changed challenge 3 files * Rename CH-03-A-Grounding-v2 (1).ipynb to CH-03-A-Grounding.ipynb * Rename CH-03-B-Chunking - v2 (1).ipynb to CH-03-B-Chunking.ipynb * Rename CH-03-C-Embeddings-v2 (1).ipynb to CH-03-C-Embeddings.ipynb * Fixed md formatting * Fixed md formatting * took out the depreciated model call * Fixed md formatting * Fixed md formatting * removed depreciated model * Adding CH3 Coach guides * Rename CH-03-A-Grounding (2).ipynb to CH-03-A-Grounding-Solution.ipynb * Rename CH-03-B-Chunking (2).ipynb to CH-03-B-Chunking-Solution.ipynb * Rename CH-03-C-Embeddings (2).ipynb to CH-03-C-Embeddings-Solution.ipynb * modified CH3-A coach solutions * modified CH3-B coach solutions * modified CH3-C coach solutions * Update Challenge-00.md Removing davinci-003, added VM SKU size. * modified path * added instructions about zip file * added instructions about jupyter notebook files * Update Challenge-01.md revised description * Update Challenge-02.md revised description * Update Challenge-01.md * Update Challenge-02.md * Delete CH-02-ModelComparison.ipynb * Add files via upload * Update CH-02-ModelComparison.ipynb * Update Challenge-01.md * Update Challenge-02.md * Update README.md * Update Challenge-03.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-04.md * Update Challenge-05.md * tookout extra notes * tookout extra notes * added cell outputs * added cell outputs * added pip installs * added pip installs * added pip installs * Update README.md * Update README.md * Update Solution-01.md * deletion of venv and renaming folder * deletion of venv * Delete Solution-07.md * Delete Solution-08.md * Delete Solution-15.md * Delete Solution-14.md * Delete Solution-13.md * Delete Solution-12.md * Delete Solution-11.md * Delete Solution-10.md * Delete Solution-09.md * Delete Solution-06.md * Delete Lectures.pptx * Update Challenge-00.md * Update Challenge-00.md * Delete Solution-04.md * Delete Solution-05.md * Update Challenge-00.md * Update Challenge-00.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-03.md * Update Challenge-04.md * Update Challenge-05.md * Update Challenge-00.md * added whitelisted words * Edit Coach and Main Markdowns * wordsmith readme and CH0.md * Update README.md * fixed spelling mistakes * fixed spelling mistakes * added CH3 solution md * Update Solution-01.md update md * Update Solution-02.md update guide * Update Solution-01.md * Update Solution-02.md * Update Challenge-01.md * Update Challenge-01.md * Update Challenge-02.md * Update Challenge-02.md * Update Challenge-02.md * removed extra markdown files * Update Solution-03.md Replaced <Title of Challenge> with Grounding, Chunking, and Embedding * Update Challenge-00.md Fixed typo * Update Challenge-01.md Fixed typo * Update Challenge-04.md Fixed typo * Update README.md FIxed some typos * changed ! to Note: in Challenge-01.md * added title and some additional guidance on python in Solution-00.md * added sample env file * removed suggested agenda section * added conda version and took out depreciated model mention * added project env tip for vscode * added tips to get credentials from azure portal for aoai * took out depreciated model mentions * Creating challenge 2 as optional * Creating challenge 2 as optional * Update Challenge 5 * Update CH-01-PromptEngineering-Solution.ipynb * Add files via upload * fix solution add variable * Fixed typos Fixed typos. Also changed "challenges" within a challenge to "tasks" to make it clearer * Update README.md Added WTH to the title as per standard. Wordsmithed intro paragraph to indicate this hack is designed to be self-administered vs. regular WTH format. Fixed double spacing in Learning Objectives. * Update README.md Fixed broken ToC links (used to go to /Challenges subfolder) * Update Challenge-00.md Added spaces to the hack title in the intro section. Fixed the broken links in the Common Prereq section * Update Challenge-01.md Added "back" link to C0. Added descriptive text to two hyperlinks for models. Bolded the "note" * Update Challenge-05.md Removed phantom link to C6 * Update Challenge 00-02 for clarity * Clarify .env usage in CH-01-PromptEngineering * Update .wordlist.txt * Update Challenge-00.md Updated Challenge 00 with clearer organization and temporary link to the Resources.zip file * Update Challenge-00.md Added clarification to C0 regarding set up of Jupyter Notebooks. * Update Challenge-00.md fixed typos * removed repeat instructions for requirements.txt * WTH Feedback Changes * Update Challenge-02.md * Update Challenge-02.md * Update sample-env.txt * Update Solution-02.md * update to add clarifications * Update Challenge-01.md only one model is needed, add clarification * Update Solution-01.md add instructions * Update CH-02-ModelComparison.ipynb * changed version to be set to the env variable * added ch4 feedback from participants * changed format of success criteria * changed spelling issue * Edit CH 05 and Coach Guide * changes coach md files * Update CH-01-PromptEngineering.ipynb Fixed grammar issues * Update Challenge-02.md Cleaned up location of Resource.zip file (it's covered in Challenge 00) * Update Challenge-01.md code blocked folder name * Update Challenge-03.md code blocked folder location * Update Challenge-04.md folder location in code block * Update Challenge-04.md Grammar fix in Intro paragraph * Update Challenge-05.md folder to code block * Update Solution-01.md * Update Solution-02.md Fixed title to match Challenge 2 title. * Update Solution-03.md * Update Solution-04.md Fixed incorrect challenge title & navigation links * Update Solution-04.md * Update README.md moved Suggested Hack Agenda back to its proper location in the template. * Update README.md Remove "Additional Coach Prereqs" as there are none. Cleaned up "Repo Contents" section which was referring to older folders that don't exist. * Update README.md wordsmithed Suggested Hack Agenda * Update README.md * Update README.md wordsmithed the intro section and commented out the link to lecture deck until one is posted. * Update Solution-01.md added links to the solution notebooks in the coach guide * Update Solution-01.md * Update Solution-01.md * Update Solution-02.md added links to solution guide * Update Solution-03.md added link to solution notebooks * Update Solution-04.md * Delete 066-OpenAIFundamentals/Coach/Solutions/.gitkeep removed .gitkeep folder placeholder file * Update Challenge-00.md test task list toc at start * Update Challenge-00.md * Update Challenge-00.md fixed toc anchor links * Update Challenge-00.md Edited link to Free Azure Trial * Update README.md * Update Challenge-02.md playing with formatting * Update Challenge-02.md * Update Challenge-03.md formatting * Update Challenge-04.md updated reference from "3" jupyter notebooks to 2. * Update Challenge-05.md fixed reference to single notebook * Update Challenge-05.md wordsmithed success criteria * Update README.md commented out ACS cost * Update README.md * Update CH-02-ModelComparison.ipynb Changed "Student Challenge" to "Student Task" - 4 instances * Update CH-03-A-Grounding.ipynb grammar * Update CH-03-A-Grounding.ipynb * Update CH-03-A-Grounding.ipynb made SC a top level section * Update CH-03-B-Chunking.ipynb byte pair encoding (BPE) -> full title first, followed by abbreviation. Then you can use the abbreviation for subsequent mentions. * Update CH-03-B-Chunking.ipynb removed gratuitous "so" * Update CH-03-B-Chunking.ipynb clarify where PDF file is located * Update CH-03-B-Chunking.ipynb Removed task text from section headers, promoted Success Criteria to section header ## * Update CH-03-C-Embeddings.ipynb put embedding types in bullet list. promoted Success Criteria to section header. * Update CH-03-C-Embeddings.ipynb grammar on embedding types * Update CH-03-C-Embeddings.ipynb • Student Task #1 text out of section header. • Make 3.1 Generate Embeddings for a CSV file a section header • Make Student Task #2 a section header * Update CH-03-C-Embeddings.ipynb o Clarify location of CSV file -> /Resources/data/Automobile.csv * Update CH-04-A-RAG_for_structured_data.ipynb clean up Use case section * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb grammar * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-04-A-RAG_for_structured_data.ipynb fixed section header for Document Zone task * Update CH-04-A-RAG_for_structured_data.ipynb * Update CH-05-ResponsibleAI.ipynb text anchor link fix * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb bolded Note on a new line * Update CH-05-ResponsibleAI.ipynb * Update CH-05-ResponsibleAI.ipynb Descriptive text for resource links * Update CH-02-ModelComparison.ipynb Fixed text for how to request quota increases. * Update CH-02-ModelComparison.ipynb added note in bold * Update CH-02-ModelComparison.ipynb * Update CH-03-C-Embeddings.ipynb Fixed grammar on Text Search Embeddings * Update CH-03-C-Embeddings.ipynb Student Challenge #2 => Student Task #2 * Update CH-03-C-Embeddings.ipynb fixed locationi * Update CH-04-A-RAG_for_structured_data.ipynb Summarizer => Text summarization * Update CH-04-A-RAG_for_structured_data.ipynb added challenge number to the title * Update CH-04-A-RAG_for_structured_data.ipynb Added a note about possibly needing to update the path * Update CH-04-A-RAG_for_structured_data.ipynb remove comment * Update CH-04-B-RAG_for_unstructured_data.ipynb added title to challenge * Update CH-04-B-RAG_for_unstructured_data.ipynb * Added additional text clarifying paths to data files and what the student goals are for each notebook in C4 * Update CH-05-ResponsibleAI.ipynb shortened title to match student guide title * try to fix anchor links again * remove broken anchor links * student challenge to student task * Fixed table errors in Notebook C2 * Updated text describing ENV file in Ch1 notebook. Depreciated -> deprecated * updated ENV language --------- Co-authored-by: melody-N07 <70866315+melody-N07@users.noreply.github.com> Co-authored-by: Devanshi Thakar <33441411+devanshithakar12@users.noreply.github.com> Co-authored-by: Alexis Joseph <36856999+alexistj@users.noreply.github.com> Co-authored-by: ShivaKumarChittamuru <shchitt@microsoft.com> Co-authored-by: Amanda Wong <wongg.amanda@gmail.com> Co-authored-by: Amanda Wong <amawong@microsoft.com> Co-authored-by: Devanshi Thakar <thakardevanshi98@gmail.com> Co-authored-by: juanlldc <125307653+juanlldc@users.noreply.github.com> Co-authored-by: Pete Rodriguez <perktime@users.noreply.github.com> Co-authored-by: wongamanda <wongamanda@uchicago.edu> Co-authored-by: Peter C. Laudati <plaudati@hotmail.com> Co-authored-by: Israel Ekpo <37992436+izzymsft@users.noreply.github.com>
2023-11-07 06:03:09 +03:00
## Success Criteria
To complete this challenge successfully, you should be able to:
- Verify that you are able to ground a model through the system message
- Demonstrate various chunking techniques
- Demonstrate how to create embeddings
## Additional Resources
* [Grounding LLMs](https://techcommunity.microsoft.com/t5/fasttrack-for-azure/grounding-llms/ba-p/3843857)
* [Embeddings example](https://github.com/openai/openai-cookbook/blob/main/examples/Embedding_Wikipedia_articles_for_search.ipynb)
* [Langchain Chunking](https://js.langchain.com/docs/modules/indexes/text_splitters/examples/recursive_character)