Граф коммитов

486 Коммитов

Автор SHA1 Сообщение Дата
ghostplant 0a6df04423 fix parsing code name for MI200/300 2023-09-05 15:29:49 +08:00
ghostplant 9fc5dcec3f fallback using gcc atomic 2023-08-29 07:04:56 +00:00
ghostplant daaa0204d7 update rocm dockerfiles 2023-08-10 18:33:32 +08:00
ghostplant 86317b0350 update container env for torch2 2023-08-09 17:02:29 +08:00
ghostplant 0246d68184 update compute compate list 2023-08-04 06:13:08 +08:00
ghostplant 2215a0f53e add get_hash as builtin operation 2023-07-23 13:25:19 +00:00
Ziming Miao 1e76ea3152
support msvc build (#369) 2023-07-17 22:14:57 +08:00
ghostplant e576e86ec3 input calibration in numpy format 2023-07-17 15:30:50 +08:00
ghostplant f50649c15f update GPU policy for non GPU devices 2023-07-17 12:20:35 +08:00
ghostplant 4ab9d09d59 update sycl shared memory parsing 2023-07-13 17:53:02 +08:00
ghostplant d9adfbb604 add more atomic defines 2023-07-06 23:54:22 +08:00
ghostplant 6936e1b8bf add compile support for xbox 2023-07-06 01:23:44 +08:00
ghostplant 1f0e700e33 parse CODE_DEBUG option when saving object 2023-07-03 10:38:05 +00:00
ghostplant d592a86c6b add custom defines & atomic macro 2023-07-01 15:32:15 +08:00
ghostplant 28218cbbe5 handle codegen method for __builtin_cond 2023-06-18 16:40:18 +08:00
ghostplant 142b419f64 parse LDS defined from subfunction 2023-06-17 15:10:41 +08:00
ghostplant 64b4644370 add namespace to call init function 2023-06-16 13:51:09 +08:00
Wei CUI 4100774b73 use uint as boolean buffer type 2023-06-14 17:43:04 +08:00
ghostplant 3abbbf2358 deal with persistent device context 2023-06-13 23:40:28 +08:00
ghostplant c3762eaced using newline before appending codes 2023-05-08 13:32:01 +08:00
ghostplant 5df9a1d2c5 avoid reconnecting socket due to multi-init 2023-04-19 00:00:27 +08:00
Wei CUI 458acf2b39 add d2d entry of HLSL backend 2023-04-18 13:07:27 +08:00
Wei CUI edfec644b7 add custom renaming of kernel function 2023-04-15 23:52:06 +08:00
ghostplant 7880d49af2 support uploading file for filemgr.py 2023-04-11 03:31:31 +08:00
ghostplant 6880cab5ca add filemgr.py 2023-04-09 08:29:45 +08:00
Wei CUI e3650e6bd9 update binder to support upload 2023-04-09 08:23:45 +08:00
ghostplant 30308c6db7 aligning memsize for GPU memalloc 2023-04-07 23:41:23 +08:00
ghostplant 3df7c2f0d2 add -ldl to cuda 2023-04-07 20:15:39 +08:00
ghostplant 60aa00bc48 fix replacement of scatter IR 2023-04-02 15:48:58 +08:00
ghostplant 65a50439d2 fp16 x int32 -> fp16 2023-03-31 15:08:38 +08:00
Wei CUI fe3f3e39ba allow xbox client to work in standalone mode 2023-03-29 20:59:43 +08:00
ghostplant c95ca32481 refactor for refresh codegen 2023-03-06 15:43:22 +08:00
ghostplant 450b3c2f56 define uint32_t to avoid undefined types 2023-02-26 13:58:11 +00:00
ghostplant 5b8ec8b63f make moduleLoad compatible for src & bin 2023-02-24 19:05:42 +08:00
ghostplant b3d4d3e182 set flags for dependency checking 2023-02-17 23:46:13 +08:00
ghostplant 1a102320bf fix template typename sycl atomics 2023-02-13 00:16:30 +08:00
ghostplant 8d761ae973 less strict condition dtype 2023-02-10 23:08:47 +08:00
Wei CUI 6f7a1a73dc handle the difference of memory management 2023-02-10 13:38:08 +08:00
Wei CUI 39b0ccf50e deal with flags for remote backend 2023-02-10 00:20:15 +08:00
ghostplant 7f809d4e12 update kernel parsing 2023-02-09 23:15:54 +08:00
ghostplant af839a1dfc add binder script 2023-02-09 22:45:17 +08:00
ghostplant 5232f3b303 fix hlsl atomic 2023-02-09 21:21:53 +08:00
ghostplant 0bad75f7a2 fix aggregation max/min 2023-02-09 15:11:13 +08:00
ghostplant d3dce9220c synchronize common atomic funcs 2023-02-09 00:28:01 +08:00
ghostplant ca07421e4e sync to sycl-2023 2023-02-07 00:08:19 +08:00
Wei CUI 433f4c7c3e add flags to distinguish xbox model 2023-02-01 23:06:19 +08:00
Wei CUI 30ed0dd261 apply ATOMIC_ADD symbol 2023-01-31 20:32:49 +08:00
Wei CUI b347b3a562 upgrade antares_hlsl.dll to v0.3.4 2023-01-31 15:55:10 +08:00
Wei CUI 43938402b8 dxwarpper support fatbin compilation 2023-01-31 15:38:20 +08:00
Wei CUI 9361419404 add template flags / recommend mode + compat 2023-01-30 17:54:57 +08:00