Skip to content

Commit fd7151e

Browse files
nagkumar91Nagkumar ArkalgudNagkumar Arkalgud
authored andcommitted
azure-ai-evaluation release 1.0.0b5 2024-10-28 (Azure#38138)
* Update task_query_response.prompty remove required keys * Update task_simulate.prompty * Update task_query_response.prompty * Update task_simulate.prompty * Fix the api_key needed * Update for release * Black fix for file * Add original text in global context * Update test * Update the indirect attack simulator * Black suggested fixes * Update simulator prompty * Update adversarial scenario enum to exclude XPIA * Update changelog * Black fixes * Remove duplicate import * Fix the mypy error * Mypy please be happy * Updates to non adv simulator * accept context from assistant messages, exclude them when using them for conversation * update changelog * pylint fixes * pylint fixes * remove redundant quotes * Fix typo * pylint fix * Update broken tests * Include the grounding json in the manifest * Fix typo * Come on package * Release 1.0.0b5 * Notice from Chang * Remove adv_conv template parameters from the outputs --------- Co-authored-by: Nagkumar Arkalgud <[email protected]> Co-authored-by: Nagkumar Arkalgud <[email protected]>
1 parent 7e130e0 commit fd7151e

File tree

3 files changed

+24
-1
lines changed

3 files changed

+24
-1
lines changed

sdk/evaluation/azure-ai-evaluation/CHANGELOG.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Release History
22

3-
## 1.0.0b5 (Unreleased)
3+
## 1.0.0b5 (2024-10-28)
44

55
### Features Added
66
- Added `GroundednessProEvaluator`, which is a service-based evaluator for determining response groundedness.

sdk/evaluation/azure-ai-evaluation/NOTICE.txt

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,3 +48,23 @@ distributed under the License is distributed on an "AS IS" BASIS,
4848
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
4949
See the License for the specific language governing permissions and
5050
limitations under the License.
51+
52+
53+
License notice for [Is GPT-4 a reliable rater? Evaluating consistency in GPT-4's text ratings](https://www.frontiersin.org/journals/education/articles/10.3389/feduc.2023.1272229/full)
54+
------------------------------------------------------------------------------------------------------------------
55+
Copyright © 2023 Hackl, Müller, Granitzer and Sailer. This work is openly licensed via [CC BY 4.0](http://creativecommons.org/licenses/by/4.0/).
56+
57+
58+
License notice for [Is ChatGPT a Good NLG Evaluator? A Preliminary Study](https://aclanthology.org/2023.newsum-1.1) (Wang et al., NewSum 2023)
59+
------------------------------------------------------------------------------------------------------------------
60+
Copyright © 2023. This work is openly licensed via [CC BY 4.0](http://creativecommons.org/licenses/by/4.0/).
61+
62+
63+
License notice for [SummEval: Re-evaluating Summarization Evaluation.](https://doi.org/10.1162/tacl_a_00373) (Fabbri et al.)
64+
------------------------------------------------------------------------------------------------------------------
65+
© 2021 Association for Computational Linguistics. This work is openly licensed via [CC BY 4.0](http://creativecommons.org/licenses/by/4.0/).
66+
67+
68+
License notice for [Evaluation Metrics in the Era of GPT-4: Reliably Evaluating Large Language Models on Sequence to Sequence Tasks](https://aclanthology.org/2023.emnlp-main.543) (Sottana et al., EMNLP 2023)
69+
------------------------------------------------------------------------------------------------------------------
70+
© 2023 Association for Computational Linguistics. This work is openly licensed via [CC BY 4.0](http://creativecommons.org/licenses/by/4.0/).

sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/simulator/_adversarial_simulator.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -276,6 +276,9 @@ def _to_chat_protocol(
276276
"target_population",
277277
"topic",
278278
"ch_template_placeholder",
279+
"chatbot_name",
280+
"name",
281+
"group",
279282
):
280283
template_parameters.pop(key, None)
281284
if conversation_category:

0 commit comments

Comments
 (0)