antares

Граф коммитов

Автор	SHA1	Сообщение	Дата
ghostplant	0246d68184	update compute compate list	2023-08-04 06:13:08 +08:00
ghostplant	d592a86c6b	add custom defines & atomic macro	2023-07-01 15:32:15 +08:00
ghostplant	3abbbf2358	deal with persistent device context	2023-06-13 23:40:28 +08:00
ghostplant	30308c6db7	aligning memsize for GPU memalloc	2023-04-07 23:41:23 +08:00
ghostplant	3df7c2f0d2	add -ldl to cuda	2023-04-07 20:15:39 +08:00
ghostplant	5b8ec8b63f	make moduleLoad compatible for src & bin	2023-02-24 19:05:42 +08:00
Wei CUI	cf369c9fcf	update ROCm dockerfile to 5.3	2022-11-04 02:00:34 +08:00
Wei CUI	ea21104493	compile gfx code by source hint	2022-10-13 15:14:18 +08:00
ghostplant	44679fab4c	fix corner cases	2022-09-07 23:05:56 +08:00
Wei CUI	94143f6b5e	support CBV for hlsl_win64 & hlsl_xbox	2022-08-18 16:37:01 +08:00
ghostplant	4ecbcf80f8	handle cumemalloc size of zero	2022-08-16 14:46:53 +08:00
ghostplant	a085e945d1	allow compilation for CUDA_ALL_ARCH	2022-07-31 00:03:45 +08:00
ghostplant	d484dcddeb	update args parsing of booting threads	2022-07-23 14:50:12 +08:00
Wei CUI	4545fa31d2	upgrade hlsl runtime to v0.3	2022-07-08 19:00:00 +08:00
ghostplant	9658b1cca3	refine execute_module.hpp	2022-07-05 21:50:46 +08:00
ghostplant	dde0726412	update clang-format option & compile time	2022-05-08 18:17:55 +08:00
Wei CUI	82c5bd8dd3	split moduleCompile and moduleLoad	2022-04-25 14:29:15 +08:00
ghostplant	b04370b31d	update vamap_args	2022-03-25 00:53:44 +08:00
Wei CUI	7d7566d330	fix bug when single tensor memsize >= 4GB	2022-03-22 00:06:50 +08:00
Wei CUI	95c832d5ee	setup codegen flags for vamap	2022-03-18 21:20:32 +08:00
Wei CUI	8d37c4c093	enhance runtime checking of python version & cuda	2021-08-23 15:18:47 +08:00
ghostplant	06a6bfae57	support mcpu tuning in pytorch JIT plugin (#255 )	2021-04-29 22:30:11 +08:00
ghostplant	356ca4a474	Backend stream (#242 ) * include stream in backend interfaces * synchronize stream for win64 backends	2021-04-18 15:53:22 +08:00
ghostplant	1100a3d806	move tempfile/syscall to ab_utils (#188 )	2021-03-02 16:47:47 +08:00
ghostplant	7737b7e6db	update tf/pytorch plugin to support graph execution (#180 )	2021-02-26 23:18:42 +08:00
ghostplant	505698739f	add missing Dockerfiles for win64 backends (#179 )	2021-02-26 14:15:05 +08:00
ghostplant	68e6d04ccb	rename c-hlsl to c-hlsl_win64 (#176 ) * rename c-hlsl to c-hlsl_win64	2021-02-26 10:10:26 +08:00
ghostplant	5fff4f2bc9	add foreground timeout monitoring (#174 )	2021-02-25 12:34:48 +08:00
ghostplant	af453be434	Upgrade Next Format (#170 ) * upgrade next format	2021-02-23 23:28:39 +08:00
ghostplant	8307b33e85	Upgrade Evaluate Format (#165 ) * upgrade exxcution format * proper refactor * split backend interfaces to headers * part refactor: cancel prebuild; trans src v2; incomplete backends * moving eval_client to graph_evaluator * refactor backend C interface * init C backend implementation for c-mcpu * finish upgrade for scpu/mcpu/sycl/cuda/rocm * finish upgrade for c-ocl * finish partial upgrade for c-gc * finish partial upgrade for c-hlsl * add codehub folder * remove frameworks/antares * synchronize JIT plugin parser	2021-02-21 14:15:27 +08:00

30 Коммитов