mozilla/aom - aom

Граф коммитов

Автор	SHA1	Сообщение	Дата
Yaowu Xu	fe291b647e	Replace variants of 'vp8' and 'vp9' with 'vpx' Change-Id: Id6cb96b0b15efdda63348d8bfe59fc0533c85ba1	2016-08-04 22:20:38 +00:00
Yaowu Xu	a3cff08259	more cleanup of vp8 and vp9 Change-Id: Ic90ebe6136f4b75645ba699d49c0bcb3764ddccf	2016-08-03 12:20:33 -07:00
Yaowu Xu	d4c4724090	Cherry pick renaming changes from AOMedia branch Manually cherry-picked the following changes: `8c8d16de` vp9 -> vpx in names `75b57d39` VP9_ -> VPX_ in function names `761a7088` VP9_INTERP_EXTEND -> VPX_INTERP_EXTEND `4273a52c` VP9->VPX in border pixel macros `03568c31` VP9_FRAME_MARKER -> VPX_FRAME_MARKER `2334f51d` VP9->VPX in fdct function names Change-Id: Icc18dbf4b416dd0fa21033b3e19ab8a47c893508	2016-07-29 13:31:32 -07:00
Yaowu Xu	3fa28d51af	More vp8/vp9 clean up Change-Id: I8101de20e873c19d03c7fd2977bc22003e395807	2016-07-28 18:22:47 -07:00
Yaowu Xu	3bd709fafe	Remove vp8, vp9 folders Change-Id: I09b8acd22d031ece52e1fee18b998349bf1cf06b	2016-07-28 14:33:21 +00:00
Ivan Krasin	eb66904d7c	Fix compilation error under Clang 4.0. The LLVM trunk has reached 4.0 and now __clang_major__ is not enough to distinguish between old XCode Clang and the new 'real' Clang. Using __apple_build_version__ allows to make this distinction. BUG=chromium:631144 Change-Id: I0b6e46fddfe4f409c7b7e558bda34872e60ee2d9	2016-07-26 04:12:08 +00:00
Yury Gitman	a8de3c0c89	Add VPX_SWAP macro Change-Id: I60e233eddef238ad918183392794084673f27d2d	2016-07-26 04:08:06 +00:00
Yaowu Xu	53fb766d2a	inline->INLINE: vpx_dsp/x86/blend_a64_vmask_sse4.c This fixes the build using MSVC. Change-Id: I7788e28fd4f0c6ff3d405c4b4a2ff59eda3ba8b6	2016-07-25 10:43:12 -07:00
Yaowu Xu	44aac61c13	Add and remove explicit type conversions Change-Id: I8b791fda7c64a0363549add99dc9fcae3b29beae	2016-07-22 10:04:06 -07:00
Johann	2967bf355e	Merge changes from libvpx/master by cherry-pick This commit bring all up-to-date changes from master that are applicable to nextgenv2. Due to the remove VP10 code in master, we had to cherry pick the following commits to get those changes: Add default flags for arm64/armv8 builds Allows building simple targets with sane default flags. For example, using the Android arm64 toolchain from the NDK: https://developer.android.com/ndk/guides/standalone_toolchain.html ./build/tools/make-standalone-toolchain.sh --arch=arm64 \ --platform=android-24 --install-dir=/tmp/arm64 CROSS=/tmp/arm64/bin/aarch64-linux-android- \ ~/libvpx/configure --target=arm64-linux-gcc --disable-multithread BUG=webm:1143 vpx_lpf_horizontal_4_sse2: Remove dead load. Change-Id: I51026c52baa1f0881fcd5b68e1fdf08a2dc0916e Fail early when android target does not include --sdk-path Change-Id: I07e7e63476a2e32e3aae123abdee8b7bbbdc6a8c configure: clean up var style and set_all usage Use quotes whenever possible and {} always for variables. Replace multiple set_all calls with able_feature(). Conflicts: build/make/configure.sh vp9-svc: Remove some unneeded code/comment. datarate_test,DatarateTestLarge: normalize bits type quiets a msvc warning: conversion from 'const int64_t' to 'size_t', possible loss of data mips added p6600 cpu support Removed -funroll-loops psnr.c: use int64_t for sum of differences Since the values can be negative. .asm: normalize label format add a trailing ':', though it's optional with the tools we support, it's more common to use it to mark a label. this also quiets the orphan-labels warning with nasm/yasm. BUG=b/29583530 Prevent negative variance Due to rounding, hbd variance may become negative. This commit put in check and clamp of negative values to 0. configure: remove old visual studio support (<2010) BUG=b/29583530 Conflicts: configure configure: restore vs_version variable inadvertently lost in the final patchset of: 078dff7 configure: remove old visual studio support (<2010) this prevents an empty CONFIG_VS_VERSION and avoids make failure Require x86inc.asm Force enable x86inc.asm when building for x86. Previously there were compatibility issues so a flag was added to simplify disabling this code. The known issues have been resolved and x86inc.asm is the preferred abstraction layer (over x86_abi_support.asm). BUG=b:29583530 convolve_test: fix byte offsets in hbd build CONVERT_TO_BYTEPTR(x) was corrected in: 003a9d2 Port metric computation changes from nextgenv2 to use the more common (x) within the expansion. offsets should occur after converting the pointer to the desired type. + factorized some common expressions Conflicts: test/convolve_test.cc vpx_dsp: remove x86inc.asm distinction BUG=b:29583530 Conflicts: vpx_dsp/vpx_dsp.mk vpx_dsp/vpx_dsp_rtcd_defs.pl vpx_dsp/x86/highbd_variance_sse2.c vpx_dsp/x86/variance_sse2.c test: remove x86inc.asm distinction BUG=b:29583530 Conflicts: test/vp9_subtract_test.cc configure: remove x86inc.asm distinction BUG=b:29583530 Change-Id: I59a1192142e89a6a36b906f65a491a734e603617 Update vpx subpixel 1d filter ssse3 asm Speed test shows the new vertical filters have degradation on Celeron Chromebook. Added "X86_SUBPIX_VFILTER_PREFER_SLOW_CELERON" to control the vertical filters activated code. Now just simply active the code without degradation on Celeron. Later there should be 2 set of vertical filters ssse3 functions, and let jump table to choose based on CPU type. improve vpx_filter_block1d* based on replace paddsw+psrlw to pmulhrsw Make set_reference control API work in VP9 Moved the API patch from NextGenv2. An example was included. To try it, for example, run the following command: $ examples/vpx_cx_set_ref vp9 352 288 in.yuv out.ivf 4 30 Conflicts: examples.mk examples/vpx_cx_set_ref.c test/cx_set_ref.sh vp9/decoder/vp9_decoder.c deblock filter : moved from vp8 code branch The deblocking filters used in vp8 have been moved to vpx_dsp for use by both vp8 and vp9. vpx_thread.[hc]: update webp source reference + drop the blob hash, the updated reference will be updated in the commit message BUG=b/29583578 vpx_thread: use native windows cond var if available BUG=b/29583578 original webp change: commit 110ad5835ecd66995d0e7f66dca1b90dea595f5a Author: James Zern <jzern@google.com> Date: Mon Nov 23 19:49:58 2015 -0800 thread: use native windows cond var if available Vista / Server 2008 and up. no speed difference observed. 100644 blob 4fc372b7bc6980a9ed3618c8cce5b67ed7b0f412 src/utils/thread.c 100644 blob 840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h vpx_thread: use InitializeCriticalSectionEx if available BUG=b/29583578 original webp change: commit 63fadc9ffacc77d4617526a50c696d21d558a70b Author: James Zern <jzern@google.com> Date: Mon Nov 23 20:38:46 2015 -0800 thread: use InitializeCriticalSectionEx if available Windows Vista / Server 2008 and up 100644 blob f84207d89b3a6bb98bfe8f3fa55cad72dfd061ff src/utils/thread.c 100644 blob 840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h vpx_thread: use WaitForSingleObjectEx if available BUG=b/29583578 original webp change: commit 0fd0e12bfe83f16ce4f1c038b251ccbc13c62ac2 Author: James Zern <jzern@google.com> Date: Mon Nov 23 20:40:26 2015 -0800 thread: use WaitForSingleObjectEx if available Windows XP and up 100644 blob d58f74e5523dbc985fc531cf5f0833f1e9157cf0 src/utils/thread.c 100644 blob 840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h vpx_thread: use CreateThread for windows phone BUG=b/29583578 original webp change: commit d2afe974f9d751de144ef09d31255aea13b442c0 Author: James Zern <jzern@google.com> Date: Mon Nov 23 20:41:26 2015 -0800 thread: use CreateThread for windows phone _beginthreadex is unavailable for winrt/uwp Change-Id: Ie7412a568278ac67f0047f1764e2521193d74d4d 100644 blob 93f7622797f05f6acc1126e8296c481d276e4047 src/utils/thread.c 100644 blob 840831185502d42a3246e4b7ff870121c8064791 src/utils/thread.h vp9_postproc.c missing extern. BUG=webm:1256 deblock: missing const on extern const. postproc - move filling of noise buffer to vpx_dsp. Fix encoder crashes for odd size input clean-up vp9_intrapred_test remove tuple and overkill VP9IntraPredBase class. postproc: noise style fixes. gtest-all.cc: quiet an unused variable warning under windows / mingw builds vp9_intrapred_test: follow-up cleanup address few comments from ce050afaf3e288895c3bee4160336e2d2133b6ea Change-Id: I3eece7efa9335f4210303993ef6c1857ad5c29c8	2016-07-18 10:31:10 -07:00
Yaowu Xu	6fe07a207b	Merge branch 'master' into nextgenv2 Change-Id: Ia3c0f2103fd997613d9f16156795028f89f63265	2016-07-14 16:05:48 -07:00
Geza Lore	ebc2d34cd9	Add SSE4.1 vpx_obmc_variance* implementations and cosmetics Speedup for these functions: 4x Also include some cosmetic changes to SAD functions Change-Id: I344c32c795492507ae08742f52d035a13f583799	2016-07-12 21:04:46 -07:00
Geza Lore	c804e0df05	Cleanup obmc_sad function prototypes. Name 'wsrc', 'mask' and 'pre' explicitly, rather than using 'b', 'm' and 'a'. Change-Id: Iaee6d1ac1211b0b05b47cf98b50570089b12d600	2016-07-12 13:23:33 +01:00
James Zern	e046f5efef	highbd_quantize_intrin_sse2.c: add missing rtcd include quiets -Wmissing-prototypes warnings BUG=b/29584271 Change-Id: Iff5214df0d1781810afbfc20bfaf664f109e2f29	2016-07-11 16:52:08 -07:00
Geza Lore	bfa59b4a5f	Improve vpx_blend_* functions. - Made source buffers pointers to const. - Renamed vpx_blend_mask6b to vpx_blend_a64_mask. This is more indicative that the function does alpha blending. The 6, or 6b suffix was misleading, as the max mask value (64) does not fit into 6 bits. - Added VPX_BLEND_* macros to use when needing to blend scalars. - Use VPX_BLEND_A256 in combine_interintra to be more explicit about the operation being done. - Added versions of vpx_blend_a64_* which take 1D horizontal/vertical masks directly and apply them to all rows/columns (vpx_blend_a64_hmask and vpx_blend_a64_vmask). The SSE4.1 optimzied horizontal version now falls back on the 2D version. This can be improved upon if it show up high enough in a profile. - All vpx_blend_a64_* functions now support block sizes down to 1x1 (ie: a single pixel). This is for usage convenience. The SSE4.1 optimized versions fall back on the C implementation if w <= 2 or h <= 2. This can again be improved if it becomes hot code. Change-Id: I13ab3835146ffafe3e1d74d8e9cf64a5abe4144d	2016-07-11 19:05:17 +01:00
Debargha Mukherjee	72ef6d7704	Refactor and clean up on blend_mask6 Change-Id: Ie9188471e7dc07ab9c95b22f258b1662e895c533	2016-07-08 15:02:57 -07:00
Yue Chen	c7a92f2cad	Merge "Add SSE4.1 vpx_obmc_sad* implementations." into nextgenv2	2016-07-07 01:12:20 +00:00
James Zern	abf05c3e60	Merge changes I9433d858,Iafd05637,If08ce6ca into nextgenv2 * changes: tests: remove redundant round() definition remove visual studio < 2010 workarounds configure: remove old visual studio support (<2010)	2016-07-06 23:25:35 +00:00
Geza Lore	aacdf98c9a	Add SSE4.1 vpx_obmc_sad* implementations. Speedup for these functions: 4x Change-Id: I21baa04f53c6ab308ea3edf3ebacc62970e97454	2016-07-06 19:46:13 +00:00
Geza Lore	007aa7dd65	Refactoring in preparation for OBMC optimizations. - Use int32_t instead of int in vpx_obmc{variance,sad} functions - Remove weigthed_src and obmc mask strides and assume contiguous buffers. These inputs can always be packed as contiguous arrays. Change-Id: I74c09b3fb3337f13d39e13a9cb61e140536f345d	2016-07-04 16:57:17 +01:00
James Zern	26f6b216a1	remove visual studio < 2010 workarounds BUG=b/29583530 Change-Id: Iafd05637eb65f4da54a9c857e79204a77646858a (cherry picked from commit c125f4a594815ad63b50e4b684ada4b44c545932)	2016-06-29 17:11:10 -07:00
Yi Luo	08184e32de	Change register loading to fix stack overflow issue - Use _mm_loadl_epi64 instead of _mm_loadu_si128 for uint16_t temp2[4 * 4] buffer. - Refer to: `d0de89a` remove vpx_highbd_1[02]_sub_pixel_variance4x4_sse4_1 BUG=webm:1242 Change-Id: Ieff555c8dd8070937f27f4ec8535b77e1ed5b8b2	2016-06-24 10:39:49 -07:00
Yaowu Xu	ef665996ae	Prevent negative variance Due to rounding used computation, HDB variance computation may produce slightly negative values. This commit adds clamping to make sure output variance values for 10 and 12 to be non-negative. Change-Id: Id679aa55a4c201958c4c7d28cd8733b9246a71c8	2016-06-22 17:55:14 +00:00
James Zern	5d14586392	Merge "remove vpx_highbd_1[02]_sub_pixel_variance4x4_sse4_1" into nextgenv2	2016-06-22 03:13:31 +00:00
Geza Lore	52141c9111	Add 1D version of vpx_sum_squares_i16 Change-Id: I1829f931749a26aec38c896b609c5a2640d6dfaf	2016-06-21 20:31:50 +01:00
Yaowu Xu	543ea3eb3e	Make type conversion explicit This fixes MSVC warnings. Change-Id: I675d8486230b2b74d7973d95720a4995c4750282	2016-06-20 12:05:29 -07:00
James Zern	d0de89a12a	remove vpx_highbd_1[02]_sub_pixel_variance4x4_sse4_1 these cause ASan errors VP10/EndToEndTestLarge.EndtoEndPSNRTest BUG=webm:1242 Change-Id: I0334e3b255b14e18f61970c3721ae748dc79727b	2016-06-17 19:46:20 -07:00
Yaowu Xu	de3a8f23c8	vpx_dsp/quantize.c: fix ubsan warnings BUG=webm:1219 Change-Id: I0c80271c6b78adf40aa7a4cac9e6b431d56958cb	2016-06-16 21:46:14 +00:00
Yaowu Xu	e5e998a6eb	vpx_dsp/variance.c: change to use correct type This commit change to use int64_t to represent the sum of pixel differences, which can be negative. This fixes a number of ubsan warnings. BUG=webm:1219 Change-Id: I885f245ae895ab92ca5f3b9848d37024b07aac98	2016-06-16 21:45:48 +00:00
Geza Lore	ffa9173378	Use correct size load in vpx_avg_4x4_sse2. The old version used 64 bit loads, and then ignored the top half of the result. This can cause asan failures if we read past the end of a buffer. Switched to using 32 bit loads instead. Change-Id: I57da127a26f869fb4b4f700b55408f6dc2fbbc1a	2016-06-16 11:49:24 +01:00
Johann	c516dd67bc	neon hadamard 16x16 Runs about twice as fast as C BUG=webm:1027 Change-Id: I6760d99f4e22259439ca35d746194b12a81bfa71	2016-06-14 19:23:38 +00:00
James Zern	5e831c548f	Revert "Add 1D version of vpx_sum_squares_i16" This reverts commit `f19700fe52`. This crashes in SSE2/SumSquares2DTest.RandomValues/0 under x86 due to alignment issues Change-Id: I135d83ba6a7894c09d7c7a139b7eaf876416b40c	2016-06-09 23:42:15 -07:00
Debargha Mukherjee	697bcef677	Add a couple of missing WRAPLOW checks To make coefficient checking consistent with the VP9 spec sections 8.7.1.6 and 8.7.1.1. Change-Id: I92e38e89a41d1e482317bb478c48ffa608d2d6ee	2016-06-09 12:58:27 -07:00
Debargha Mukherjee	c2ebd0e6da	Merge "Move range checks into WRAPLOW"	2016-06-06 16:28:24 +00:00
James Zern	e34e684059	Merge changes If31d36c8,I10b947e7 * changes: vpx_dsp,add_noise: remove mmx implementation vpx_dsp: remove mmx variance implementations	2016-06-04 00:56:06 +00:00
Debargha Mukherjee	aa90983696	Move range checks into WRAPLOW Provides more comprehensive coverage for --enable-coefficient-checking. The intent is to make the --enable-coefficient-checking option consistent with the VP9 spec. Change-Id: I12d0120756d17572ca2b2d7e6a2ab9d8071d8d58	2016-06-03 11:27:33 -07:00
Linfeng Zhang	b90166665f	Merge "Slow pshufb removal in 3 intra prediction functions."	2016-06-03 16:35:14 +00:00
Geza Lore	f19700fe52	Add 1D version of vpx_sum_squares_i16 Change-Id: I0d7bda2fe6f995a9e88a9f66540b4979b3f7fab1	2016-06-03 09:34:55 +01:00
Geza Lore	9ebca46933	Factor out x86 SIMD intrinsic synonyms Change-Id: Idc4ac3ccd2ba19087cdb74a3e4a6774ac50386aa	2016-06-03 09:34:55 +01:00
James Zern	462e0ff88b	vpx_dsp,add_noise: remove mmx implementation a sse2 version exists, this is a reasonable modern baseline. Change-Id: If31d36c8412d25b53f41b4a93cf02f46802c0c33	2016-06-02 23:51:22 -07:00
James Zern	eea8ea88ab	vpx_dsp: remove mmx variance implementations there are sse2 equivalents for all remaining variance implementations Change-Id: I10b947e73fc0067688181f819b59e47966bec3d2	2016-06-02 23:46:16 -07:00
Linfeng Zhang	ad0646cb84	Slow pshufb removal in 3 intra prediction functions. Replaced vpx_d45_predictor_4x4_ssse3(), vpx_d45_predictor_8x8_ssse3() and vpx_d207_predictor_4x4_ssse3() with created vpx_d45_predictor_4x4_sse2(), vpx_d45_predictor_8x8_sse2() and vpx_d207_predictor_4x4_sse2() respectively. It's mostly neutral or slightly worse than ssse3 in good cases and better than ssse3 in the bad cases (but still worse than using the mmx regs). Change-Id: Ib0237ceb71d2c57b8a93fd3170330cfed9d56bdd	2016-06-02 10:55:58 -07:00
Yaowu Xu	46ff1072b3	variance_avx2.c: UBSAN/IOC fix BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1222 Change-Id: Ifb3bedf9b4e1b007b21aebaa4beb9ba50424efef	2016-05-31 16:44:35 -07:00
Linfeng Zhang	0ba9b299e9	Merge "Upgrade vpx_lpf_{vertical,horizontal}_4 mmx to sse2"	2016-05-27 15:47:28 +00:00
Linfeng Zhang	4b5e462d08	Upgrade vpx_lpf_{vertical,horizontal}_4 mmx to sse2 Followed the code style of other lpf fuctions. These 2 functions put 2 rows of data in a single xmm register, so they have similar but not identical filter operations, and cannot share the same macros. Change-Id: I3bab55a5d1a1232926ac8fd1f03251acc38302bc	2016-05-26 14:55:18 -07:00
Scott LaVarnway	9d24fe60f1	Merge "Code clean of sub_pixel_variance4xh -- 2"	2016-05-26 13:20:24 +00:00
Scott LaVarnway	a4f3751be5	Code clean of sub_pixel_variance4xh -- 2 Replace MMX with SSE2. Change-Id: Id8482d2589131f9427e7f36bc64413f058caf31f	2016-05-24 04:44:05 -07:00
Geza Lore	a661bc87c4	Add optimized vpx_blend_mask6 This is to replace vp10/common/reconinter.c:build_masked_compound. Functionality is equivalent, but the interface is slightly more generic. Total encoder speedup with ext-inter: ~7.5% Change-Id: Iee18b83ae324ffc9c7f7dc16d4b2b06adb4d4305	2016-05-23 16:28:58 +01:00
James Zern	3fb55d24e8	Revert "Code clean of sub_pixel_variance4xh" This reverts commit `2468163e07`. causes valgrind errors for overread of buffer in SubpelVarianceTest Change-Id: I448e52c76f815ac199305b71f7d169f2bc167679	2016-05-19 23:37:27 -07:00
Yaowu Xu	d1f0f4cc63	Merge "Clarify integer value ranges"	2016-05-18 23:55:05 +00:00

1 2 3 4 5 ...

407 Коммитов