Граф коммитов

2931 Коммитов

Автор SHA1 Сообщение Дата
Tarek Mahmoud Sayed d0aa2c2461
Address the feedback on the tokenizer's library (#7024)
* Fix cache when calling EncodeToIds

* Make EnglishRoberta _mergeRanks thread safe

* Delete Trainer

* Remove the setters on the Bpe properties

* Remove Roberta and Tiktoken special casing in the Tokenizer and support the cases in the Model abstraction

* Support text-embedding-3-small/large embedding

* Remove redundant TokenToId abstraction and keep the one with the extra parameters

* Enable creating Tiktoken asynchronously or directly using the tokenizer data

* Add cancellationToken support in CreateAsync APIs

* Rename sequence to text and Tokenize to Encode

* Rename skipSpecialTokens to considerSpecialTokens

* Rename TokenizerResult to EncodingResult

* Make Token publicly immutable

* Change offset tuples from (Index, End) to (Index, Length)

* Rename NormalizedString method's parameters

* Rename Model's methods to start with verb

* Convert  Model.GetVocab() method to a Vocab property

* Some method's parameters and variable renaming

* Remove Vocab and VocabSize from the abstraction

* Cleanup normalization support

* Minor Bpe cleanup

* Resolve rebase change

* Address the feedback
2024-02-26 10:57:18 -08:00
Stephen Toub 4b89d98760
Optimize regexes used in tiktoken (#7020)
* Optimize regexes used in tiktoken

* Add comment and consolidate duplicate regex from Roberta
2024-02-22 16:41:24 -08:00
Michael Sharp a13937153d
fix formatting that fails in VS (#7023) 2024-02-22 14:49:28 -07:00
Michael Sharp 0d2fd603a2
Temp fix for the race condition during the tests. (#7021)
* temp fixing the race condition during the tests

* making db tests sequential
2024-02-21 13:14:45 -07:00
Michael Sharp 4d4f7dc1ec
Update OnnxRuntime to 1.16.3 (#6975)
* onnx update

* formatting update

* adding onnx runtime reference to automl tests
2024-02-21 11:58:38 -07:00
zewditu Hailemariam a1b7e1d249
match AutoMl names with Ml.net names (#7007) 2024-02-21 01:36:18 -07:00
Stephen Toub 2c9f775d06
Tweak CreateByModelNameAsync (#7015)
- Add a CancellationToken to CreateByModelNameAsync, allowing the download and parsing to be canceled.
- Use ReadLineAsync(cancellationToken), which not only allows it to be canceled, but avoids ~100K task allocations
- Fix Helpers.FromBase64String to support lines longer than 300 chars
2024-02-20 12:08:18 -08:00
Stephen Toub 3282f44853
Tweak Tiktoken's BytePairEncode for improved perf (#7017)
- Stackalloc the indices/ranks when feasible
- Use a span to eliminate bounds checks and allow for directly updating ranks
2024-02-20 11:02:04 -08:00
Stephen Toub eb66d731d4
Avoid LruCache in Tiktoken when cacheSize specified is 0 (#7016) 2024-02-20 10:34:11 -08:00
Stephen Toub f97642412a
First round of perf improvements for tiktoken (#7012) 2024-02-18 12:28:17 -08:00
Tarek Mahmoud Sayed 4635a862dd
Tokenizer's Interfaces Cleanup (#7001)
* Tokenizer's Interfaces Cleanup

* Address the feedback

* Optimization
2024-02-16 12:48:38 -07:00
zewditu Hailemariam 64523e8aa7
Add text normalizer transformer to AutoML (#6998)
* Add text normalizer transformer to AutoML

* clean

* clean

* Add  default

* Follow default pattern of ML.Net
2024-02-13 15:41:41 -08:00
dotnet-policy-service[bot] e3a06e354a
FabricBot: Onboarding to GitOps.ResourceManagement because of FabricBot decommissioning (#6983)
* Add prIssueManagement.yml to onboard repo to GitOps.ResourceManagement as FabricBot replacement

 Details on the replacement service and the syntax of the new yaml configuration file is available publicly at: https://microsoft.github.io/GitOps/policies/resource-management.html

Please review and merge this PR to complete the process of onboarding to the new service.

* Deleting fabricbot.json

* De-Dupe the Write permission check

* Add automation for adding/removing the untriaged label to issues

---------

Co-authored-by: dotnet-policy-service[bot] <123482357+dotnet-policy-service[bot]@users.noreply.github.com>
Co-authored-by: Jeff Handley <jeffhandley@users.noreply.github.com>
2024-02-12 08:37:15 -08:00
dotnet-maestro[bot] 96e2692eaf
Update dependencies from https://github.com/dotnet/arcade build 20240202.4 (#6985)
Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 9.0.0-beta.24076.5 -> To Version 9.0.0-beta.24102.4

Co-authored-by: dotnet-maestro[bot] <dotnet-maestro[bot]@users.noreply.github.com>
2024-02-07 11:58:00 -08:00
Tarek Mahmoud Sayed 6f55525602
Introducing Tiktoken Tokenizer (#6981)
* Introducing Tiktoken Tokenizer

* Address the feedback

* file renaming
2024-02-06 11:13:28 -08:00
dotnet-maestro[bot] 902102e23d
Update dependencies from https://github.com/dotnet/arcade build 20240126.5 (#6976)
Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 9.0.0-beta.24069.2 -> To Version 9.0.0-beta.24076.5

Co-authored-by: dotnet-maestro[bot] <dotnet-maestro[bot]@users.noreply.github.com>
2024-01-30 11:36:24 -08:00
zewditu Hailemariam 4d69110150
Add new type to key-value converter (#6973)
* Add new type to key-value

* Make it IDataView

* clean

* add dataview support in source generator

* Update type_converter_search_space.json

* Update search-space-schema.json

---------

Co-authored-by: XiaoYun Zhang <xiaoyuz@microsoft.com>
Co-authored-by: Xiaoyun Zhang <bigmiao.zhang@gmail.com>
2024-01-26 15:49:31 -07:00
dotnet-maestro[bot] 4c44e927bc
[main] Update dependencies from dotnet/arcade (#6969)
* Update dependencies from https://github.com/dotnet/arcade build 20240119.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 9.0.0-beta.24062.5 -> To Version 9.0.0-beta.24069.2

* Prefer XUnitVersion property from arcade

---------

Co-authored-by: dotnet-maestro[bot] <dotnet-maestro[bot]@users.noreply.github.com>
Co-authored-by: Eric StJohn <ericstj@microsoft.com>
2024-01-24 13:14:24 -08:00
Michael Sharp 54fa44fbf8
testing light gbm tests sequentially (#6968) 2024-01-23 10:32:40 -07:00
zewditu Hailemariam 125b6d5d3f
Add sweepable estimator to NER (#6965) 2024-01-19 12:36:36 -07:00
Eric StJohn 48b6fbed7a
Update .NET Runtimes to latest version (#6964) 2024-01-19 11:32:29 -08:00
Michael Sharp 5e28578ef7
added in release notes for 3.0.1 (#6962) 2024-01-18 11:00:06 -07:00
dotnet-maestro[bot] e46b80889c
Update dependencies from https://github.com/dotnet/arcade build 20240112.5 (#6957)
Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 9.0.0-beta.23628.1 -> To Version 9.0.0-beta.24062.5

Co-authored-by: dotnet-maestro[bot] <dotnet-maestro[bot]@users.noreply.github.com>
2024-01-17 14:51:50 -07:00
Michael Sharp b824ed2a09
Torch sharp version updates and test fixes (#6954)
* torch sharp version updates and test fixes

* removed extra test file

* job templat updated for test coverage
2024-01-16 20:59:17 -07:00
Xiaoyun Zhang ac6d130491
fix #6949 (#6951)
* fix #6949

* add MoveForwardToAttribute to assembly.cs and fix search space sub namespace
2024-01-10 13:02:30 -07:00
Eric StJohn d0d8569220
Add Backport github workflow (#6944) 2024-01-08 13:15:51 -08:00
Michael Sharp 9f4a389891
Split out non concurrent test collections. (#6937)
* Split out non concurrent test collections.

* Fix Core Tests

* removed torchsharp and tensorflow tests when not x64

* fixes from pr comments

* removing unnecessary line from the proj file
2024-01-08 10:00:49 -07:00
Michael Sharp 8896dd2927
Fixes NER to correctly expand/shrink the labels (#6928)
* ner options fix

* Ner fixed.

* Update src/Microsoft.ML.Tokenizers/Model/EnglishRoberta.cs

Co-authored-by: Eric StJohn <ericstj@microsoft.com>

* fixes from PR comments

* fixed build

---------

Co-authored-by: Eric StJohn <ericstj@microsoft.com>
2024-01-05 17:52:17 -07:00
Eric StJohn 373a86467c
Only use semi-colons for NoWarn (#6935) 2024-01-02 18:02:13 -08:00
dotnet-maestro[bot] f625080a07
[main] Update dependencies from dotnet/arcade (#6703)
* Update dependencies from https://github.com/dotnet/arcade build 20230519.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23269.2

* Add dotnet8 nuget feed

* Update dependencies from https://github.com/dotnet/arcade build 20230529.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23279.1

* Update dependencies from https://github.com/dotnet/arcade build 20230602.3

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23302.3

* Update dependencies from https://github.com/dotnet/arcade build 20230609.8

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23309.8

* Update dependencies from https://github.com/dotnet/arcade build 20230616.6

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23316.6

* Update dependencies from https://github.com/dotnet/arcade build 20230622.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23322.2

* Update dependencies from https://github.com/dotnet/arcade build 20230630.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23330.1

* Update dependencies from https://github.com/dotnet/arcade build 20230710.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23360.1

* Update dependencies from https://github.com/dotnet/arcade build 20230714.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23364.2

* Update dependencies from https://github.com/dotnet/arcade build 20230721.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23371.1

* Update dependencies from https://github.com/dotnet/arcade build 20230728.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23378.2

* Update dependencies from https://github.com/dotnet/arcade build 20230804.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23404.2

* Update dependencies from https://github.com/dotnet/arcade build 20230811.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23411.1

* Update dependencies from https://github.com/dotnet/arcade build 20230819.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23419.1

* Update dependencies from https://github.com/dotnet/arcade build 20230825.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23425.2

* Update dependencies from https://github.com/dotnet/arcade build 20230901.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23451.1

* Update dependencies from https://github.com/dotnet/arcade build 20230901.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23451.1

* Update dependencies from https://github.com/dotnet/arcade build 20230913.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23463.1

* Update dependencies from https://github.com/dotnet/arcade build 20230913.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23463.1

* Update dependencies from https://github.com/dotnet/arcade build 20230913.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 8.0.0-beta.23463.1

* Update dependencies from https://github.com/dotnet/arcade build 20231008.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23508.1

* Update dependencies from https://github.com/dotnet/arcade build 20231010.4

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23510.4

* Update dependencies from https://github.com/dotnet/arcade build 20231018.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23518.2

* Update dependencies from https://github.com/dotnet/arcade build 20231028.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23528.2

* Update dependencies from https://github.com/dotnet/arcade build 20231103.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23553.1

* Update dependencies from https://github.com/dotnet/arcade build 20231110.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23560.1

* Update dependencies from https://github.com/dotnet/arcade build 20231117.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23567.1

* Fixed version update breaks.

* Update dependencies from https://github.com/dotnet/arcade build 20231122.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23572.2

* Update dependencies from https://github.com/dotnet/arcade build 20231201.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23601.1

* Update dependencies from https://github.com/dotnet/arcade build 20231207.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23607.2

* Update dependencies from https://github.com/dotnet/arcade build 20231215.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23615.2

* Update XUnitVersion

* Update MicrosoftMLOnnxRuntimeVersion to 1.16.3

* Rollback OnnxRuntime and suppress warning

* Update to Xunit with fix for https://github.com/xunit/xunit/issues/2821

* Ensure we pull down 8.0 runtime.

* Update Centos docker containers

* Fix packaging step

* Try including stdint.h to fix missing uint8_t on centos

* Update Centos test queue

* Attempt to use runtime centos-stream8-helix container for tests

* Use centos-stream8-mlnet-helix container for testing

* Undo changes to test data

* Make NETFRAMEWORK ifdef versionless

* Switch back to centos7 for testing

* Revert "Switch back to centos7 for testing"

This reverts commit ab0d41e4b7.

* Update dependencies from https://github.com/dotnet/arcade build 20231221.2

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23621.2

* Update dependencies from https://github.com/dotnet/arcade build 20231228.1

Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild , Microsoft.DotNet.XliffTasks , Microsoft.DotNet.XUnitExtensions
 From Version 8.0.0-beta.23265.1 -> To Version 9.0.0-beta.23628.1

---------

Co-authored-by: dotnet-maestro[bot] <dotnet-maestro[bot]@users.noreply.github.com>
Co-authored-by: Eric StJohn <ericstj@microsoft.com>
Co-authored-by: Michael Sharp <misharp@microsoft.com>
2024-01-02 15:46:00 -08:00
Eric StJohn d3c31274ce
Make double assertions compare with tolerance instead of precision (#6923)
Precision might cause small differences to round to a different number.
Instead compare with a tolerance which is not sensitive to rounding.
2023-12-22 12:18:29 -08:00
Eric StJohn 2093331055
Fix assert by only accessing idx (#6924)
Asserting on `_rowCount <  Utils.Size(_valueBoundaries)` was catching a
case where `_rowCount`'s update was reordered before `_valueBoundaries`

This was unnecessary, since this method doesn't need to use `_rowCount`.

Instead, make the asserts use only `idx` which will be maintained
consistent with the waiter logic in this cache.
Ensure we only ever use `_rowCount` from the caching thread, so write
reordering won't matter.
2023-12-22 12:12:12 -08:00
Eric StJohn a60be5f215
Rename NameEntity to NamedEntity (#6917) 2023-12-21 10:08:50 -08:00
Eric StJohn b8f71b9c66
Don't include the SDK in our helix payload (#6918)
* Don't include the SDK in our helix payload

I noticed that the tests included the latest SDK - including the host -
in our helix payloads.

This is a large amount of unnecessary downloads and it also makes it so
we use the latest host on the older frameworks which can fail when the
latest host drops support for distros.

Since our tests shouldn't need the full CLI, remove this from our helix
payloads.

We'll instead get just the runtime we need through `AdditionalDotNetPackage`

* Place Helix downloaded runtime on the PATH

Helix only sets the path when the CLI is included, however we don't
need the CLI.
2023-12-21 08:37:50 -08:00
Michael Sharp 5483ba93c5
updated ml.net versioning (#6907) 2023-12-12 12:03:30 -08:00
Aleksei Smirnov efab0114ed
Reorganize dataframe files (#6872)
* Increase performance of elementwise comparison operations

* Fix Perf Test

* Reorganize files in the DataFrame related projects

* Fix merge issues
2023-12-12 10:04:47 -07:00
The Algorithms Guy eb9af180a2
Issue #6606 - Add sample variance and standard deviation to NormalizeMeanVariance (#6885)
* Add sample standard deviation flag

* Update NormalizeColumnDbl.cs

* Update NormalizeColumnSng.cs

---------

Co-authored-by: emeikleham <emeikleham@exactimaging.com>
Co-authored-by: Michael Sharp <51342856+michaelgsharp@users.noreply.github.com>
2023-12-11 15:57:23 -07:00
JJ 9d92597fe8
Update developer-guide.md (#6870)
Fixed a couple of spelling errors
2023-12-07 13:31:55 -07:00
Aleksei Smirnov f32ae2c687
Add support for Apache.Arrow.Types.TimestampType to DataFrame (#6871) 2023-12-07 13:27:01 -07:00
taeerhebend 7366b995d4
Update release-3.0.0.md (#6895) 2023-12-07 13:15:04 -07:00
Jeff Handley 4223c2720e
Migrate to the 'locker' GitHub action for locking closed/stale issues/PRs (#6896) 2023-11-30 17:14:50 -08:00
Michael Sharp d96d7b741e
Release notes for 3.0 (#6888)
* release notes 3.0

* Apply suggestions from code review

Co-authored-by: Jeff Handley <jeffhandley@users.noreply.github.com>

* updates from pr

---------

Co-authored-by: Jeff Handley <jeffhandley@users.noreply.github.com>
2023-11-22 12:35:14 -07:00
Michael Sharp db08da61b8
added in win-arm64 (#6813)
* added in win-arm64

* fixed MKL arm64 cmake issue

* right helix queue

* added in win-arm64

* fixed MKL arm64 cmake issue

* right helix queue

* fixing arm tests

* makes x64 test detection better

* change test label

* fixed onnx files not being included

* added new win-arm baselines

* baseline changes

* fixed build issue

* fixed test

* one more basleine

* fixed pack for mkl redist

* .NET update
2023-11-20 22:05:11 -07:00
Michael Sharp aeb1ab8f6b
Updates LightGBM from 2.X.X to 3.X.X (#6880)
* updated lightGBM version

* finished LightGBM baseline update

* removed accidental enums

* Fixed LightGBM Test

* fixed lightgbm

* fixed version

* fixed test with LightGBM

* fixed test result
2023-11-16 16:42:38 -07:00
Michael Sharp d2cf997d90
Changes some of the CPU Math implemenation from our current version to use the new TensorPrimitives package. (#6875)
* using tensor primitives

* added missing files

* some with indexes changed

* Initial swap for TensorPrimitives done

* Rebased and cleaned code

* more minor cleanup

* added system.numerics.tensors version to props

* build fixes

* added net6 again

* updates from PR comments

* fixed sumabsu

* fixed baseline tests

* test fixes

* fixed test failure for kmeans

* changed decimal comparison

* updated more baselines

* Test fixes.

* template update

* Test Fixes.

* fixed performance test csproj

* added baselines for linux arm/64

* fixed linux arm baselines

* fixed arm baselines

* removed extra files

* arm32 baselines updated

* fixed arm baselines
2023-11-14 22:46:15 -07:00
Jeff Handley d8ad1e65b5
FabricBot: Remove area pod project board automation (#6881) 2023-11-14 15:10:26 -07:00
Aleksei Smirnov 796cb354fb
Improve performance of DataFrame binary comparison operations (#6869)
* Increase performance of elementwise comparison operations

* Fix Perf Test

* Fix code review findings
2023-10-22 22:23:12 -07:00
Akash Kundu a3d3813511
Update DataViewRowCursor.md (#6855)
* Update DataViewRowCursor.md

fixed some errors and typos.

* Update docs/code/DataViewRowCursor.md

---------

Co-authored-by: Eric StJohn <ericstj@microsoft.com>
2023-10-19 14:52:58 -07:00
Aleksei Smirnov e82575021e
Avoid Boxing/Unboxing on accessing elements of VBufferDataFrameColumn (fix merge issues) (#6867)
* Avoid Boxing/Unboxing on accessing elements of VBufferDataFrameColumn

* Avoid boxing for vbuffer column
2023-10-17 13:31:50 -07:00
Aleksei Smirnov 766569b86a
Avoid Boxing/Unboxing on accessing elements of VBufferDataFrameColumn (#6865) 2023-10-16 16:32:43 -07:00