mozilla/aom - aom

Граф коммитов

Автор	SHA1	Сообщение	Дата
Yi Luo	dd2064a0ac	Merge "Fix bugs in convolution filter optimization" into nextgenv2	2016-06-27 21:33:45 +00:00
Yi Luo	8404253f81	Fix bugs in convolution filter optimization - Fix the over-writing bug in horizontal filtering as width = 2. - Fix 10-tap vertical filtering which no longer reads one row of pixel above the block. - Fix 10-tap filter zero padding. - Encoder speed slow down ~4.0%, compared to, `81ad953` Convolution vertical filter SSSE3 optimization Change-Id: I9bb294a4529300081c29bf284e6bc6eb081cc536	2016-06-27 10:23:38 -07:00
Debargha Mukherjee	9f2167aede	Merge "Turn on ActiveMapRefreshTest for Vp10" into nextgenv2	2016-06-25 00:32:21 +00:00
Debargha Mukherjee	cf0cdfc55e	Turn on ActiveMapRefreshTest for Vp10 Also reduce number of frames coded for VP10. Change-Id: I7de908861620b6f4f08513516110fd584660d994	2016-06-24 12:55:03 -07:00
Yi Luo	2003cd8011	Merge "Change register loading to fix stack overflow issue" into nextgenv2	2016-06-24 18:47:21 +00:00
Yi Luo	08184e32de	Change register loading to fix stack overflow issue - Use _mm_loadl_epi64 instead of _mm_loadu_si128 for uint16_t temp2[4 * 4] buffer. - Refer to: `d0de89a` remove vpx_highbd_1[02]_sub_pixel_variance4x4_sse4_1 BUG=webm:1242 Change-Id: Ieff555c8dd8070937f27f4ec8535b77e1ed5b8b2	2016-06-24 10:39:49 -07:00
Yi Luo	81ad95363a	Convolution vertical filter SSSE3 optimization - Apply 8-pixel vertical filtering direction parallelism. - Add unit tests to verify bit exact. - Encoder speed improves ~29% (enable EXT_INTERP) on Xeon E5-2680. - Combinational cycle count of vp10_convolve() drops from 26.06% to 6.73%. Change-Id: Ic1ae48f8fb1909991577947a8c00d07832737e57	2016-06-23 12:56:47 -07:00
Yi Luo	f26a48bd52	Fix input buffer initialization in convolution filter test Change-Id: I70c0da96a81463d752e88b134b6fde012bd5823d	2016-06-22 11:46:16 -07:00
James Zern	5d14586392	Merge "remove vpx_highbd_1[02]_sub_pixel_variance4x4_sse4_1" into nextgenv2	2016-06-22 03:13:31 +00:00
Geza Lore	7de2ba3eae	Fix false uninitialized warnings (GCC 5+). Change-Id: Ia00c754ddaf22bb7f1dfcd20106db6293bf4b070	2016-06-21 12:54:17 +01:00
Yi Luo	f1a50db2d1	Merge "Convolution horizontal filter SSSE3 optimization" into nextgenv2	2016-06-20 20:06:02 +00:00
Yi Luo	229690a95c	Convolution horizontal filter SSSE3 optimization - Apply signal direction/4-pixel vertical/8-pixel vertical parallelism. - Add unit test to verify the bit exact result. - Overall encoding time improves ~24% on Xeon E5-2680 CPU. Change-Id: I104dcbfd43451476fee1f94cd16ca5f965878e59	2016-06-20 11:10:30 -07:00
Debargha Mukherjee	dc5431ad4b	Merge "Turn on AqSegment tests for VP10" into nextgenv2	2016-06-20 16:47:13 +00:00
James Zern	d0de89a12a	remove vpx_highbd_1[02]_sub_pixel_variance4x4_sse4_1 these cause ASan errors VP10/EndToEndTestLarge.EndtoEndPSNRTest BUG=webm:1242 Change-Id: I0334e3b255b14e18f61970c3721ae748dc79727b	2016-06-17 19:46:20 -07:00
Geza Lore	7172e97abe	Re-enable ActiveMapTest for VP10 Change-Id: I030fdde966b9911712eca131d095015afd9b0d8a	2016-06-17 20:33:58 +01:00
Zoe Liu	5201280f70	Disable the unit test of ArfFreq for BIDIR_PRED The test in arf_freq assumes any no-show frame as ALTREF_FRAME and then calculate the minimum run between two consecutive ALTREF_FRAME's based on this assumption. As BWDREF_FRAME is also a no-show frame and the minimum run between two consecutive BWDREF_FRAME's may vary between 1 and any arbitrary positive number as long as it does not exceed the golden frame group interval, this test does not apply to the experiment of BIDIR_PRED. Change-Id: I70efb2c691fdc18601dbb8a7735ac2f27817e75a	2016-06-16 09:45:57 -07:00
Zoe Liu	a0d122079d	Merge "Fix the superframe unit test for BIDIR_PRED" into nextgenv2	2016-06-16 16:15:07 +00:00
Debargha Mukherjee	567ee69b24	Turn on AqSegment tests for VP10 Also shortens the test and changes some of the parameters. Change-Id: Ieda4aeffa55550fbb9e4235f735c383ef6baf32c	2016-06-16 07:26:39 -07:00
Debargha Mukherjee	f9fc898d56	Merge "Split some slower tests based on cpu-used" into nextgenv2	2016-06-16 11:46:36 +00:00
Debargha Mukherjee	6abddf37f8	Split some slower tests based on cpu-used Change-Id: Idf84475fe06666d5c73c9d86dfc5c23bef170086	2016-06-15 23:14:51 -07:00
James Zern	94e84bbc07	cosmetics,test.mk: fix a typo Change-Id: Ib74a494e1cf50a356f51e8185e19ca66fcb896a2	2016-06-15 20:33:04 -07:00
James Zern	fba6f748e8	rename vp9_end_to_end_test.cc -> end_to_end_test.cc this is shared between vp9/10 BUG=webm:1235 Change-Id: I2f44b15268a33453a1c1e0c691d4fc1fc12d0263	2016-06-15 18:30:22 -07:00
James Zern	2710f76692	vp9_end_to_end_test: enable in vp10-only builds this file is shared between vp9 & vp10; this makes it available in the presence of --disable-vp9 BUG=webm:1235 Change-Id: Iaf060c3c09afd2c7df69995b0c01589f78d4945e	2016-06-15 18:28:30 -07:00
Zoe Liu	1aa674b588	Fix the superframe unit test for BIDIR_PRED Change-Id: I2ef8e479893403581711abc020509c6863c2035d	2016-06-15 17:18:26 -07:00
Sarah Parker	50c5921517	Add EndToEndTestLarge for VP10 non-highbitdepth The current test case is only run for vp9 and vp10 when HBD is enabled. This was mistakenly removed in: `d53f9a3` Enable VP10 HBD PSNR checking unit test Change-Id: I88b8168ad1efd805d759238a037653a2901bf50d	2016-06-15 19:45:24 +00:00
James Zern	05bd964adc	Merge "Revert "Add 1D version of vpx_sum_squares_i16"" into nextgenv2	2016-06-14 00:04:57 +00:00
James Zern	a8ba2eb3d3	active_map_refresh_test: fix missing file w/vp10-only Change-Id: I6413b7622a3c8524ec0409e087cf7c92f79e4f2d	2016-06-11 09:49:02 -07:00
Alex Converse	11ce75968f	Merge "Turn on ActiveMapTest speeds [0,5) with all experiments." into nextgenv2	2016-06-10 21:52:57 +00:00
James Zern	5e831c548f	Revert "Add 1D version of vpx_sum_squares_i16" This reverts commit `f19700fe52`. This crashes in SSE2/SumSquares2DTest.RandomValues/0 under x86 due to alignment issues Change-Id: I135d83ba6a7894c09d7c7a139b7eaf876416b40c	2016-06-09 23:42:15 -07:00
James Zern	667db87a1b	Merge "Revert "Optimize wedge partition selection."" into nextgenv2	2016-06-10 03:49:29 +00:00
Angie Chiang	95340fccb3	Revert "Optimize wedge partition selection." This reverts commit `efda2831e5`. This commit causes segmentation fault at SSE2/SumSquares2DTest.RandomValues/0 Change-Id: I171937e4daf6f15323e8206418773deb03bd8c53	2016-06-09 19:17:37 -07:00
Sarah Parker	9d924a0c4a	Fix vp9_end_to_end_test for vp10 HBD This test is failing when no experiments are turned on. PSNR is 31.96 when the threshold is 32. broken since: `0d6980d` Remove swap buffer speed feature Change-Id: I3c29815b40d5282c37f52f4345b56992f8558b2e	2016-06-09 18:47:47 -07:00
Alex Converse	587b8a11d0	Turn on ActiveMapTest speeds [0,5) with all experiments. Change-Id: I7da9e6a85648aa69e5e20d825b717d51e3c6809c	2016-06-09 13:51:00 -07:00
Alex Converse	d279cadbe0	Port active map / cyclic refresh fixes to VP10. Bring commits `575e81f` and `3d6b8a6` to VP10. These changes predate the creation of the active map cyclic refresh test. BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1224 Change-Id: I3559b6933ffa5649926a4b214e45ed0fae523a25	2016-06-09 16:52:43 +00:00
Angie Chiang	d9410d2d43	Merge "Move #if out of TEST_P in vp10_fwd/inv_txfm2d_test.cc" into nextgenv2	2016-06-07 22:02:28 +00:00
Alex Converse	7e26f01342	Turn ActiveMapTest back on. If it's creating problems with some experiments, disable it under the actual conditions where it doesn't work and file a bug. Change-Id: Iab9f4bfe42ea926d49d371918da25f9a8938a20f	2016-06-07 11:59:15 -07:00
Debargha Mukherjee	13155e7725	Merge "Optimize wedge partition selection." into nextgenv2	2016-06-07 09:50:13 +00:00
Debargha Mukherjee	24a04f9048	Merge "Fix decoder crash with supertx" into nextgenv2	2016-06-07 09:46:48 +00:00
Angie Chiang	f67196b2ed	Move #if out of TEST_P in vp10_fwd/inv_txfm2d_test.cc Change-Id: I1d5b2408f27a1e277574c2238f1e49e884596309	2016-06-06 12:45:54 -07:00
Geza Lore	efda2831e5	Optimize wedge partition selection. We can optimize wedge partition selection by pre-computing the residuals of the 2 underlying predictors, and then blend these to compute the sse of the compound predictor, without actually having to compute and subtract the compound predictor. Similarly we can pre-compute a proxy array which we can use to cheaply check which mask sign would have lower sse. Details are in wedge_utils.c. Mathematically these are equivalence transformations, but due to the finite precision the encoder output will be perturbed, though on average this should make 0% difference. ext-inter gains about ~4.5% speedup. Change-Id: Ib2657c3209ae161b4090b58b4b6c392641bf2792	2016-06-06 14:43:10 +01:00
Geza Lore	6c4306c27d	Fix decoder crash with supertx xd->plane[0].n4_h and xd->plane[0].n4_w are not set at that point when using supertx. While this fixes the immediate crash described in the referenced bug report, there are still issues in the ref-mv experiment that causes these tests to fail, so they are kept disabled. BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1230 Change-Id: Ibf8ef02847a903f8d10e6be28e16694db10c75af	2016-06-06 09:58:11 +01:00
Geza Lore	f19700fe52	Add 1D version of vpx_sum_squares_i16 Change-Id: I0d7bda2fe6f995a9e88a9f66540b4979b3f7fab1	2016-06-03 09:34:55 +01:00
Geza Lore	5a69ee0e11	Move template specializations into .cc from .h Change-Id: I6d8775c1fa228fde25016a401e3c22a8e3da42f9	2016-06-03 09:34:55 +01:00
Alex Converse	380c4ee32d	Merge "segmentation: Don't use uninitialized probability data." into nextgenv2	2016-06-01 17:50:37 +00:00
Alex Converse	7a6cb59dbb	segmentation: Don't use uninitialized probability data. BUG=https://bugs.chromium.org/p/webm/issues/detail?id=1224 Change-Id: I17b76fcf0d8c191850350d5aa50dcc007b8b0cdc	2016-05-31 16:42:29 -07:00
James Zern	5d237f0986	vp10_inv_txfm2d_test: fix memory leak input_, ref_input_ and output_ were being allocated with new[] followed by vpx_memalign, remove the former Change-Id: Ia16d0f9b9317042a24445095ad3c284f4e7bb481	2016-05-26 20:04:59 -07:00
Yi Luo	469d002f4e	Merge "Integrate HBD inverse HT flip types sse4.1 optimization" into nextgenv2	2016-05-25 21:35:14 +00:00
Yi Luo	bfe4c0ae07	Integrate HBD inverse HT flip types sse4.1 optimization - tx_size: 4x4, 8x8, 16x16. - tx_type: FLIPADST_DCT, DCT_FLIPADST, FLIPADST_FLIPADST, ADST_FLIPADST, FLIPADST_ADST. - Encoder speed improvement: park_joy_1080p_12: ~11%, crowd_run_1080p_12: ~7%. - Add unit test cases for bit-exact against C. Change-Id: Ia69d069031fa76c4625e845bfbfe7e6f6ed6e841	2016-05-25 12:32:10 -07:00
James Zern	008f27e70a	Merge "add vp10 ActiveMap/ActiveMapRefreshTest" into nextgenv2	2016-05-25 19:05:02 +00:00
Yi Luo	28cdee448d	HBD inverse HT 8x8 and 16x16 sse4.1 optimization - Covers tx_type: DCT_DCT, DCT_ADST, ADST_DCT, ADST_ADST. - Encoding speed improves ~27% on crowd_run_1080p_12. - Merge 4x4, 8x8, 16x16 unit tests in one test file. Change-Id: I058ef5254d068a9523a826480c78ebbdd231824c	2016-05-24 12:55:30 -07:00

1 2 3 4 5 ...

1668 Коммитов