mozilla/aom - aom

Граф коммитов

Автор	SHA1	Сообщение	Дата
Dmitry Kovalev	829ec56b47	Merge "Reusing FRAME_COUNTS in the encoder."	2013-12-18 18:27:08 -08:00
Jingning Han	89b6d40690	Replace cpi->common with cm in vp9_onyx_if Replace repeated cpi->common fetching with cm variable in a few places in vp9_onyx_if.c Change-Id: Ifa16d617f37919b2e0baf8efb256130a647b5eb3	2013-12-18 12:31:41 -08:00
Marco Paniconi	02d5ebcfdc	Merge "Updates for 1-pass CBR rate control."	2013-12-18 10:28:33 -08:00
Marco Paniconi	1b8b8b0d0d	Updates for 1-pass CBR rate control. Adjustments based on buffer level, frame dropper. Change-Id: Iaa85b570493526a60c4b9fb7ded4c0226b1b3a33	2013-12-18 09:24:24 -08:00
Yaowu Xu	ede392d765	Merge changes I5d28c2f5,Ib00b036f * changes: Remove redundant function and calls Add test for partial inverse transforms	2013-12-17 16:57:01 -08:00
Yaowu Xu	ed90a176af	Remove redundant function and calls lf deltas are later setup in function vp9_setup_past_independence(), so this commit removed the redundant copy. Also renamed a function to better align the behavior of the funciton. Change-Id: I5d28c2f5b12b3d31817e14296ed4605c1fd5c98c	2013-12-17 15:44:25 -08:00
Dmitry Kovalev	1d23a6594b	Reusing FRAME_COUNTS in the encoder. Change-Id: I6ab9fe2326ebbadf0dd10cca9f66cf8277e3f43b Replacing: comp_inter_count, single_ref_count, comp_ref_count.	2013-12-16 20:12:47 -08:00
Yaowu Xu	36c4e27454	Merge "Move two functions to encoder"	2013-12-16 18:09:51 -08:00
Yaowu Xu	50ec6311e6	Move two functions to encoder As they are used by encoder only. Change-Id: I7b1e6955b218aba66fe156523521a8121c9a84a4	2013-12-16 17:27:48 -08:00
Deb Mukherjee	1e59cbf23b	Rate control changes on active_worst_quality Various cleanups and refactoring. Removes feedback of active worst qaulity and uses last_q instead to make the interface cleaner. Active worst quality is now decided only once for a frame being coded in the beginning based on last_q and other stats. Also, adds other cleaups on last_q to store also the last_q for altref frames, and reduces the altref interval a little. The output does change a little. derfraw300: +0.224% (global psnr) stdhdraw250: +0.442% (global psnr) Change-Id: Ie634cdc032697044c472dd0fe79c109b3e7f9767	2013-12-16 17:08:16 -08:00
Dmitry Kovalev	4f0a381b49	Merge "Reusing nmv_frame_counts from FRAME_COUNTS in encoder."	2013-12-16 14:10:13 -08:00
Frank Galligan	d0ee1fd797	Merge "Add support to pass in external frame buffers."	2013-12-15 19:18:25 -08:00
Frank Galligan	10f891696b	Add support to pass in external frame buffers. VP9 decoder can now use frame buffers passed in by the application. Change-Id: I599527ec85c577f3f5552831d79a693884fafb73	2013-12-15 18:45:46 -08:00
Yunqing Wang	d4b500d9d7	Merge "Increase disable_filter_search_var_thresh threshold"	2013-12-13 15:11:17 -08:00
Yunqing Wang	da9f55c3fb	Increase disable_filter_search_var_thresh threshold Increased threshold(t) for interp filter search. This sped up the encoder with some PSNR loss. Borg tests were ran at speed 2. t = 100, PSNR loss: -0.710%(derf); -0.561%(stdhd); -0.647%(youtube) speedup: 9%(derf); 3%(stdhd); 5.7%(youtube) t = 500, PSNR loss: -1.687%(derf); -1.665%(stdhd); -1.664%(youtube) speedup: 18%(derf); 10%(stdhd); 8%(youtube) Change-Id: I180e3657c1e156aaa88dc7c437f8bcbd19f5caba	2013-12-13 10:47:14 -08:00
Jingning Han	3b5a90bd86	Enable adaptive pred filter type for sub8x8 This commit enables an adaptive prediction filter type selection for sub8x8 block sizes. In speed 1, it re-uses the filter type of collocated 8x8 block if it is tested in the rate-distortion optimization loop, for the sub8x8 blocks. Otherwise, it runs the normal test over all the three filter types. In speed 2, it re-uses the 8x8 block's prediction filter type, if available. Otherwise, force it to be EIGHTTAP. Compression and speed performance wise: speed 1 derf -0.266% yt -0.138% bus at 2000 kbps: 33766ms -> 30451ms (10% speed-up) football at 600 kbps: 48173ms -> 43786ms (9% speed-up) speed 2 derf -0.026% yt +0.134% bus at 2000 kbps: 18973ms -> 17698ms (6% speed-up) football at 600 kbps: 26748ms -> 25096ms (6% speed-up) Change-Id: I77e097533b969fd3472147225fa79fc98095d342	2013-12-12 17:54:34 -08:00
Deb Mukherjee	7edd5170b5	Merge "Changes interfaces to vp9_get_compressed_data fn"	2013-12-11 15:50:40 -08:00
Dmitry Kovalev	efe5b28c09	Reusing nmv_frame_counts from FRAME_COUNTS in encoder. Change-Id: Iadf2fcc9a5bfa5d02fc166f31963be1cc814831c	2013-12-11 15:16:10 -08:00
Deb Mukherjee	e33855cc47	Changes interfaces to vp9_get_compressed_data fn Silences some lint warnings in previous patches Change-Id: I04bf47ebe7e63a95fd322719a3154e589c115d78	2013-12-11 14:22:51 -08:00
Jingning Han	f92b5842bf	Merge "Full range motion search for regular block sizes"	2013-12-09 16:12:35 -08:00
Dmitry Kovalev	a19d694f09	Merge "Removing BLOCK_TYPES and adding PLANE_TYPES constant instead."	2013-12-07 02:20:41 -08:00
Alex Converse	0428579b3d	Have check_initial_width() take subsampling as arguments directly. This way it doesn't need to derive subsampling differently for each caller. Change-Id: I186aa7a84d315b796dcf2fdde5468ec12b3a59e3	2013-12-06 21:43:05 -08:00
Jingning Han	b295092b8f	Full range motion search for regular block sizes Add a full range motion search for regular block sizes. This runs exhaustive search within the given reference area. This commit further optimizes the search process by combining 4 points test into one pipeline, which gives 30% speed-up as compared to run each individual point at a time. This full range search serves as a best possible motion search reference. When replacing the diamond search with full range search, the speed 0 runtime of bus CIF at 2000 kbps goes from 153872ms to 623051ms. The compression performance compared to speed 0 setting gains 0.585% for derf set. Change-Id: Ieef1225216b0b86b4ac4872fa7fb9e18bf2eabb3	2013-12-06 12:24:53 -08:00
Dmitry Kovalev	d6b159d4a6	Removing BLOCK_TYPES and adding PLANE_TYPES constant instead. Change-Id: Ic3bb862e93aedf6a489a33ea6f7e5097d96855ee	2013-12-06 10:54:00 -08:00
Yaowu Xu	2dd730ccb3	Merge "Remove rate correction factor."	2013-12-06 10:39:51 -08:00
Dmitry Kovalev	8eac2ca840	Merge "Renaming constants."	2013-12-06 09:55:02 -08:00
Paul Wilkins	570b6d25c0	Remove rate correction factor. Removed an adaptive rate correction factor that was having a negative impact on quality in many clips. This factor was influencing the Q range available to each frame independently of the bits allocated to each. Average results with DISABLE_RC_LONG_TERM_MEM. derf +0.199, -0.059. yt +3.957, +3.798 std hd +1.577, +2.140 yt hd +4.127, +4.513 Average results without DISABLE_RC_LONG_TERM_MEM derf -0.628, -0.665 yt +3.432, +3.015 std hd -0.105, +0.153 yt hd +3.432, +3.015 Change-Id: I45bab6b606f49a442e7b27a6d631f3ffd843bbce	2013-12-06 16:57:16 +00:00
Dmitry Kovalev	d72c847fe8	Merge "Renaming PREV_COEF_CONTEXTS to COEFF_CONTEXTS."	2013-12-05 17:54:44 -08:00
Dmitry Kovalev	377fa8aff8	Renaming PREV_COEF_CONTEXTS to COEFF_CONTEXTS. Also adding BAND_COEFF_CONTEXTS macro to simplify for loop logic. Change-Id: I12a78a49cf1addf81e6b3fe2a3736ec2b79bd79e	2013-12-05 17:08:06 -08:00
Deb Mukherjee	8de1d8bfe3	Merge "Further rate control cleanups"	2013-12-05 16:55:35 -08:00
Deb Mukherjee	52d273674b	Further rate control cleanups Includes various cleanups. Streamlines the interfaces so that all rate control state updates happen in the vp9_rc_postencode_update() function. This will hopefully make it easier to support multiple rate control schemes. Removes some unnecessary code, which in rare cases can casue a difference in the constrained quality mode output, but other than that there is no bitstream change yet. Change-Id: I3198cc37249932feea1e3691c0b2650e7b0c22fc	2013-12-05 16:31:04 -08:00
Dmitry Kovalev	0d4b8d7e43	Renaming constants. NUM_YV12_BUFFERS => FRAME_BUFFERS ALLOWED_REFS_PER_FRAME => REFS_PER_FRAME NUM_REF_FRAMES_LOG2 => REF_FRAMES_LOG2 NUM_REF_FRAMES => REF_FRAMES NUM_FRAME_CONTEXTS_LOG2 => FRAME_CONTEXTS_LOG2 NUM_FRAME_CONTEXTS => FRAME_CONTEXTS Change-Id: I4e1ada08f25d8fa30fdf03aebe1b1c9df0f87e63	2013-12-05 16:23:09 -08:00
Dmitry Kovalev	3712b58c2f	Merge "Cleaning up vp9_entropy.h file."	2013-12-04 16:46:41 -08:00
Adrian Grange	584c72992a	Merge "Change default behavior to assume sampled chroma"	2013-12-04 09:35:14 -08:00
Dmitry Kovalev	8e89e2f2e0	Cleaning up vp9_entropy.h file. Renaming constants for consistency: DCT_VAL_CATEGORY1 => CATEGORY1_TOKEN DCT_VAL_CATEGORY2 => CATEGORY2_TOKEN DCT_VAL_CATEGORY3 => CATEGORY3_TOKEN DCT_VAL_CATEGORY4 => CATEGORY4_TOKEN DCT_VAL_CATEGORY5 => CATEGORY5_TOKEN DCT_VAL_CATEGORY6 => CATEGORY6_TOKEN DCT_EOB_TOKEN => EOB_TOKEN DCT_EOB_MODEL_TOKEN => EOB_MODEL_TOKEN MAX_ENTROPY_TOKENS => ENTROPY_TOKENS Moving constants: INTER_MODE_CONTEXTS from vp9_entropy.h to vp9_blockd.h. EOSB_TOKEN from vp9_entropy.h to vp9_tokenize.h Change-Id: I5fcbf081318e1d365792b6d290a930c6cb0f3fc2	2013-12-03 17:23:03 -08:00
Jingning Han	3c34619125	Fix initialization order for the encoder This commit makes the coefficient tree initialized prior to token initialization, where the coefficient costs are filled out according to the probabilities associated with coefficient value categories. Change-Id: If4e89c3923058376f8382c683fe4a225a4a38af3	2013-12-03 15:29:24 -08:00
Jingning Han	b88b49a7bc	Merge "Fix intra prediction ref selection in skip_encode"	2013-12-03 09:47:41 -08:00
Paul Wilkins	8a4310b160	Merge "Fix use_uv_intra_estimate in rd loop"	2013-12-03 04:30:50 -08:00
Jingning Han	f01ad926d0	Fix intra prediction ref selection in skip_encode This commit fixes the intra prediction reference source selection in the settings of skip_encode. Use original boundary pixels as prediction reference, when the inverse transform and reconstruction are skipped in the per block size rate-distortion optimization loop. Change-Id: I36081aa30aa46e203e0e6f4e8a420fd08269469a	2013-12-02 18:48:51 -08:00
Jingning Han	9f81a50c85	Fix use_uv_intra_estimate in rd loop This commit fixes the use of uv_intra_estimate by properly restoring the mode_info struct required by rd_pick_intra_sbuv_mode. Change-Id: I6a156d79533c4e2e60dfd3b8c5bb0a42a8eca280	2013-12-02 17:30:41 -08:00
Dmitry Kovalev	862c22cf7d	Merge "Moving token-encoding related stuff from common to encoder."	2013-12-02 10:32:04 -08:00
Deb Mukherjee	a622ed554f	Merge "Continued rate control clean-ups"	2013-11-27 12:04:38 -08:00
Deb Mukherjee	d17ac4feb2	Continued rate control clean-ups Moves all post encode rate control updates to a separate function plus other cleanups. Change-Id: I70e8eccf666c88d8b649b969997fd84d27e4baaa	2013-11-27 11:34:48 -08:00
Dmitry Kovalev	f9da823216	Moving token-encoding related stuff from common to encoder. Change-Id: I0e59d320407b3bed0ba3622a7b29975f6fad7ebf	2013-11-27 11:27:57 -08:00
Dmitry Kovalev	f4bf712fbb	Moving mode encodings from common to encoder + cleanup. Change-Id: I248ccb1532e2cd95314d0b95108f2c2e71cf084f	2013-11-26 14:53:17 -08:00
Deb Mukherjee	65f14b0067	Merge "Some cleanups on rate control"	2013-11-26 09:34:20 -08:00
Deb Mukherjee	25f1195a25	Some cleanups on rate control Removes the active_worst_qchanged variable since it is never set to 1. Change-Id: I29a291fd1068fd9b504a2db7768d45644c1eae3e	2013-11-25 18:58:45 -08:00
Dmitry Kovalev	56d048c412	Moving mv entropy encodings calculation to the encoder side. Moved arrays: vp9_mv_joint_encodings vp9_mv_class_encodings vp9_mv_class0_encodings vp9_mv_fp_encodings Change-Id: Iaf5008c579fcbd6d77fdd81d1aef8c71b5f308b7	2013-11-25 16:36:28 -08:00
Paul Wilkins	644bd87e8e	In frame Q adjustment experiment. The idea here is to allow "in frame" adjustment of the final Q value used to encode each SB64, using segmentation. There is also adjustment of the rd mult in regions of overspend. Activated using aq_mode=2 Change-Id: I2f140cd898c9f877c32cd6d2e667f5e11ada4b1c	2013-11-25 10:22:55 -08:00
Jingning Han	12e5ec6aa8	Merge "Separate setup_scale_factor/extend_frame_borders"	2013-11-25 09:14:46 -08:00
Adrian Grange	3173c21909	Change default behavior to assume sampled chroma When calling check_initial_width through vp9_set_size_literal the function was defaulting to using non-subsampled chroma. This patch changes the default to assume sampled chroma as an interim solution until complete support for other color formats is added. Change-Id: Id8e7e919b350e3473dfdf7551af6fd0716478b04	2013-11-25 08:59:29 -08:00
Dmitry Kovalev	75e4377d81	Using partition counts from FRAME_COUNTS struct in the encoder. Change-Id: I6c3d47b00acabe7ffba22ffc73741173aa9a0bff	2013-11-22 14:26:39 -08:00
Jingning Han	86d2a9b978	Separate setup_scale_factor/extend_frame_borders This commit takes out vp9_extend_frame_borders from vp9_setup_scale_factors. The refactoring is for the preparation of the use of lazy border extension at decoder. This makes it necessary to handle border extension separately at encoder/decoder. The use of vp9_extend_frame_borders will be removed, when lazy border extension is ready. Change-Id: Ia3baba3d179d5f11eee1634f19b3b319d2a59186	2013-11-22 12:02:08 -08:00
Deb Mukherjee	5576a4e1cb	Merge "Refactoring of rate control - part 1"	2013-11-22 08:06:48 -08:00
Deb Mukherjee	f1781e86b7	Refactoring of rate control - part 1 Moves all rate control variables to a separate structure, removes some currently unused variables, moves some rate control functions to vp9_ratectrl.c, and splits the encode_frame_to_data_rate function. Change-Id: I4ed54c24764b3b6de2dd676484f01473724ab52b	2013-11-22 07:07:24 -08:00
Dmitry Kovalev	87ff7f2af3	Removing old code. Change-Id: I67d1681c7b17661deb792c5e6a9e2014a73ff9b7	2013-11-20 14:05:21 -08:00
Guillaume Martres	b00057c88a	Merge "vpxenc: add --aq-mode flag to control adaptive quantization"	2013-11-20 08:13:28 -08:00
Guillaume Martres	17084657e6	vpxenc: add --aq-mode flag to control adaptive quantization Change-Id: I57e1ad4bed3487df12893ced77c49093f8755706	2013-11-15 19:42:20 +01:00
Marco Paniconi	b6ca9d917d	Merge "For CBR, keep rate-correction damping factor to 2."	2013-11-14 08:11:43 -08:00
Deb Mukherjee	cfcd5c4f61	Simplifies band-getting with a static array Simplifies the code by implementing band mapping with static arrays. A lot of the code complexity introduced in a previous patch disappears. Change-Id: Ia3fac36e594fb5ad2d55ae141c58bba4c55c2d28	2013-11-13 22:15:16 -08:00
Marco Paniconi	9977332615	For CBR, keep rate-correction damping factor to 2. The switch to the rate-correction damping factor in https://gerrit.chromium.org/gerrit/#/c/67536/ was not conditioned on CBR mode. Change-Id: I2326704e8ac030a4f7b592dd3fedb94c7dd0644d	2013-11-13 16:14:31 -08:00
Jingning Han	b6b9143218	Dual buffer encoding for intra modes Overall change (using dual buffer scheme for superblocks of both inter and intra modes) reduces speed 2 runtime: bluesky_1080p at 6000kbps: 263553ms -> 257441ms riverbed_1080p at 8000kbps: 233230ms -> 225308ms. Change-Id: Idf8d70f768a4b0d97b2a8506372c57b7b4022119	2013-11-13 12:57:03 -08:00
Jingning Han	e69461593d	Merge "Enable dual buffer rd search and encoding scheme"	2013-11-12 18:11:41 -08:00
Dmitry Kovalev	3a2ea76469	Merge "Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK."	2013-11-12 15:59:28 -08:00
Deb Mukherjee	5ade423774	Removes conditional statements from band getting Implements scan order to band map with arrays in both the encoder and decoder to remove conditional statements. Encoding seems to be about 1% faster at speed 0, tested on football. Decoding seems to be about 0.5-1% faster on a set of 25 videos. Change-Id: Idb233ca0b9e0efd790e30880642e8717e1c5c8dd	2013-11-12 10:13:27 -08:00
Jingning Han	34b6abefa2	Enable dual buffer rd search and encoding scheme This commit enables the dual buffer rate-distortion optimization and encoding scheme. It stacks the original transform coefficients, quantized levels, and reconstructed coefficients, in the rate- distortion optimization search process, hence eliminates the need to re-run residual generation, forward transform, and quantization in the encoding stage. Change-Id: I011bfad3a59a380a869ee552e91dae0394ec492e	2013-11-11 18:32:55 -08:00
Jingning Han	3b3aea6834	Allocate dual buffer sets for encoding Allocate memory space of dual buffer sets that store the coeff, qcoeff, dqcoeff, and eobs. Connect the pointers of macroblock_plane and macroblockd_plane to the actual buffer in use accordingly. Change-Id: I2f0b5f482ca879fae39095013eaf8901db20a5a4	2013-11-11 16:24:39 -08:00
Dmitry Kovalev	3551e25099	Moving {sb, mb, b, ab}_index from MACROBLOCKD to MACROBLOCK. We use {sb, mb, b, ab}_index only inside encoder, so moving them into appropriate data structure. Change-Id: Ib5c1036716354d9d321e11a60c1634c1cb8f9716	2013-11-11 15:58:57 -08:00
Jingning Han	d8b4c79270	Decouple macroblockd_plane buffer usage Make the macroblockd_plane contain dynamic buffer pointers instead static pointers to the memory space allocated therein. The decoder uses the buffer allocated in pbi, while encoder will use a dual buffer approach for rate-distortion optimization search. Change-Id: Ie6f24be2dcda35df7c15b4014e5ccf236fb3f76c	2013-11-11 15:26:10 -08:00
Ivan Maltz	741c14fcf0	Merge "Move SVC per-frame loop from sample app into libvpx proper"	2013-11-06 17:24:05 -08:00
Ivan Maltz	1ed0e1beb5	Move SVC per-frame loop from sample app into libvpx proper SVC multiple layer per frame encoding is invoked with vpx_svc_init and vpx_svc_encode. These interfaces are designed to be invoked from ffmpeg. Additional improvements: - make dummy frame handling a bit more explicit - fixed bug with single layer encodes - track individual frame sizes and psnrs instead of averages - parameterized quantizer, 16th scalefactors, more logging, - enabled single layer encodes to generate baseline - include new mode for 3 layer I frame with 5 total layers Change-Id: I46cfa600d102e208c6af8acd6132e0cc25cda8d4	2013-11-06 14:49:27 -08:00
Deb Mukherjee	be8a4cbbdd	Merge "Remove one shot q experiment"	2013-11-05 09:29:31 -08:00
Adrian Grange	44e25155f7	Remove unused members from VP9_COMP Removed: goldfreq, avg_encode_time, avg_pick_mode_time, cpu_freq, interquantizer member variables from VP9_COMP since they are no longer used in the code. Change-Id: I010a82c217d0da03c3f53d1858d3462190c12dcf	2013-11-04 12:32:17 -08:00
Adrian Grange	a0a6590e0f	Remove unused member variables from VP9_COMP Removed three members from the VP9_COMP data structure: inter_zz_count, gf_bad_count, gf_update_recommended. These were part of the VP8 real-time mode implementation that was removed from the initial VP9 codecbase. Change-Id: I866b083b88ef02c74837277d50ce532ca88492f3	2013-11-04 11:01:43 -08:00
Deb Mukherjee	1df7ef2974	Remove one shot q experiment The experiment is no longer used and can be removed. Change-Id: I9feab378fc895c120aa375353c68f93cad090609	2013-10-31 00:20:55 -07:00
Marco Paniconi	b26ce8b1be	Updates to 1-pass: -Don't reduce maxQ for gold/alt in CBR mode. -Fix to min/maxQ for first/initial key frame. -Add more speeds to datarate test and reduce the starting bitrate for test. Change-Id: Id2a333d76dd3f6a51b322ca984588e2a22159c58	2013-10-30 16:52:46 -07:00
James Zern	3ffa41aae3	Merge changes If9b16f7d,I75aab21c,I9cbb768c,If5cea3d3,I96940657,I025595d8,Ie0bc3935,I3ebb172d * changes: vp9: remove partition+entropy contexts from common vp9: add above/left_context to MACROBLOCKD vp9: add above/left_seg_context to MACROBLOCKD vp9: add above/left_context to encoder vp9: add above/left_seg_context to encoder vp9: pass entropy context directly to set_skip_context vp9: pass context directly to partition functions vp9/decode: add alloc_tile_storage()	2013-10-28 12:45:11 -07:00
James Zern	ce2c337261	vp9: add above/left_context to encoder Change-Id: If5cea3d389bb1135ee490d273e57cc2c43325d01	2013-10-25 22:01:14 +02:00
James Zern	d72dfab296	vp9: add above/left_seg_context to encoder Change-Id: I969406574c6658936e9f6db5752f1b295025aab5	2013-10-25 22:01:14 +02:00
Dmitry Kovalev	237ce8724a	Adding get_frame_new_buffer() function to replace duplicated code. Change-Id: I6e0e19231a48364c1de7dfab730b121ab227f111	2013-10-24 12:20:35 -07:00
Dmitry Kovalev	fd724f13b0	Renaming vp9_short_fdct4x4 and vp9_short_walsh4x4. For consistency with idct function names. Renames: vp9_short_fdct4x4 -> vp9_fdct4x4 vp9_short_walsh4x4 -> vp9_fwht4x4 Change-Id: Id15497cc1270acca626447d846f0ce9199770f58	2013-10-23 14:28:39 -07:00
Dmitry Kovalev	ec414372e8	Removing quantize_b_4x4 function pointer. The pointer was asigned only once with vp9_regular_quantize_b_4x4, calling this function directly now. Also removing unused declarations: prototype_quantize_block prototype_quantize_block_pair prototype_quantize_mb vp9_regular_quantize_b_4x4_pair vp9_regular_quantize_b_8x8 Change-Id: I14325bc2f082336820671eafbc06126651b79f73	2013-10-22 13:09:36 -07:00
Dmitry Kovalev	190c2b4591	Using stride (# of elements) instead of pitch (bytes) in fdct4x4. Just making fdct consistent with iht/idct/fht functions which all use stride (# of elements) as input argument. Change-Id: I0ba3c52513a5fdd194f1e7e2901092671398985b	2013-10-21 15:27:35 -07:00
Jingning Han	deb10ac6f9	Merge "Make memory alloc in pick_mode_context bsize aware"	2013-10-21 11:45:59 -07:00
Dmitry Kovalev	33a29f3c35	Merge "Moving allow_high_precision_mv from MACROBLOCKD to VP9_COMMON."	2013-10-21 10:55:02 -07:00
Dmitry Kovalev	d1b65c6bda	Moving allow_high_precision_mv from MACROBLOCKD to VP9_COMMON. This value is a global frame-level flag, not a macroblock-level. Change-Id: Ie8c5790a931150741c2167c00c3e3dd2cf26744d	2013-10-21 10:12:14 -07:00
Paul Wilkins	eec3def7c5	Modified no memory rate control. This 2-pass rate control setting allocates bits based on first pass stats to each kf group, gf group and individual frame but does not correct the bits left and allocation after each frame. In other words it recommends a bit allocation for each frame but does not try and correct any over or under spend on a frame over the remainder of the clip. This reduces the accuracy of rate control in terms of hitting an average bitrate but prevents problems that may arise because early frames either use to many or too few bits. This mode is currently more inclined to undershoot than overshoot (particularly at higher data rates). Also minor changes to rate of adaption when recode loop is not enabled. This mode is currently enabled by default for VBR. It gives the following % performance gains. derf +0.467, +1.072 yt 2.962, 2.645 stdhd 1.682, 1.595, yt-hd 2.3, 2.174 Change-Id: I3c84a9bf8884e5b345698ff0e19187f792c2f3a0	2013-10-19 12:40:43 +01:00
Paul Wilkins	a2769bb73d	Reduced delta for kf/gf/arf when at maxq. Delta reduced because of concern about popping on some very hard clips. Also allow some frame recode at speed 2 for kf/gf/arf. Change-Id: Ib47dff42da41aa6eec83b7285fcaaca24abb851e	2013-10-19 12:24:59 +01:00
Jingning Han	72033fcff8	Make memory alloc in pick_mode_context bsize aware This commit makes the buffer allocation of zcoeff_blk array in pick_mode_context block size aware. It calculates the number of 4x4 blocks in the partition and assigns the memory space accordingly. This process (and the uninitialization) is done once for each encoding pass. It allows memory copy of smaller buffer when possible. For football at 600kbps, the runtimes improve by about 1%: speed 1, 45961ms -> 45472ms speed 2, 23863ms -> 23598ms Change-Id: Id2ca24906fa89f46fa5fe742ec4b8efc2a61f877	2013-10-18 12:42:44 -07:00
Dmitry Kovalev	01993f7d4a	Removing last_kf_gf_q member from VP9Common structure. It looks like we don't actually use this value. Change-Id: If21d52b597337e7755f7ea817824fc2b1e477a14	2013-10-16 18:01:48 -07:00
Guillaume Martres	acf0d56f0b	Get rid of "this_mi", use "mi_8x8[0]" everywhere instead The only case where they were intentionally pointing to different structures was in mbgraph, and this didn't have the expected behavior because both of these pointers are used interchangeably through the code Change-Id: I979251782f90885fe962305bcc845bc05907f80c	2013-10-16 16:24:03 -07:00
Guillaume Martres	9a03154f46	Make the static_segmentation feature work again Change-Id: I766c4b74db526efa4ff6dd2d95ef3e0beb45b6e5	2013-10-16 16:15:27 -07:00
Guillaume Martres	42bcb4a7ad	Merge "Prevent accidental changes to the previous frame mode_infos"	2013-10-16 16:07:05 -07:00
Dmitry Kovalev	ab829274b1	Inlining and removing fwd_txm16x16 and fwd_txm8x8 pointers. Change-Id: I3528ba1c3fee761918509f9d9dc2d842c69f5a44	2013-10-16 15:00:48 -07:00
Marco Paniconi	e078c3d854	Initial 1-pass. Change-Id: I58c5436f5c95f6012fb2891cd2a02f76e4870b6a	2013-10-16 12:04:29 -07:00
Guillaume Martres	e55f60240a	Implement variance-based adaptive quantization This should be similar to what x264 does with --aq-mode 1. It works well with clips like parkjoy and touhou (http://x264.nl/developers/Dark_Shikari/LosslessTouhou.mkv). At low bitrates, the segmentation signaling overhead may negate the benefits of this feature. (PGW) Default changed to feature OFF to allow provisional merge. Change-Id: I938abf9bb487e1d4ad3b0264ea03d9826275c70b	2013-10-16 11:55:13 +01:00
Adrian Grange	12b2c712ca	Merge "Updated encoder to handle intra-only frames"	2013-10-15 17:19:28 -07:00
Jingning Han	9b05f23e05	Merge "Make vp9_zero use cases of consistent format"	2013-10-15 16:49:05 -07:00
Alexander Voronov	d6a59fb12c	Updated encoder to handle intra-only frames Updated the encoder to handle frames that are coded intra-only. Intra-only frames must be non-showable, that is, the "show frame" flag must be set to 0 in the frame header. Tested by forcing the ARF frames to be coded intra- only. Note: The rate control code will need to be modified to account for intra-only frames better than they are currently handled. Change-Id: I6a9dd5337deddcecc599d3a44a7431909ed21079	2013-10-15 16:44:02 -07:00
Jingning Han	c8e48f4b02	Make vp9_zero use cases of consistent format Remove the semicolon in the definition of vp9_zero macro. Make all the use cases of vp9_zero of consistent format. Change-Id: Ibaf9751e8595872b12766381a93d185a4d90df8f	2013-10-15 16:12:21 -07:00
Dmitry Kovalev	a4585285ed	Removing unused 8x4 transform from the encoder. Change-Id: Icbcf68b5b685a56f255ebc3859c9692accdadf9e	2013-10-15 11:27:28 -07:00
Yaowu Xu	8b175679be	Masking intra mode choice adaptively The commit changes to mask available intra prediction modes for test based on prediction block size. With this patch, encoding time of CpuUsed 2 reduces from 10% to 20% for HD clips with a compression drop of 0.2% Change-Id: I65f320f1237c0f5ae3a355bf7caf447f55625455	2013-10-11 10:29:53 -07:00
Paul Wilkins	704028d435	Experimental rate control change. When the codec in VBR (or cq) mode hits its max q limits and is struggling to hit a target bandwidth, the bit target per frame collapses. In the first instance normal frames cap out at the maximum allowed Q and then the ARF and GFs do the same. This latter behavior is not generally desirable as GFs and ARFs are only effective from a quality and data rate perspective if they have at lease some level of -Q delta compared to the surrounding frames. In this patch I define a separate max Q for GFs and ARFs that is derived from but somewhat lower than that defined for normal frames. In effect there is a minimum Q delta that will always be available for GFs and ARFs regardless of the target rate and MAXQ setting. This may of course mean that the absolute lowest rate obtainable for a given clip is somewhat higher. Change-Id: I268868b28401900d0cd87e51e609cd3b784ab54a	2013-10-11 13:40:54 +01:00
Paul Wilkins	8b989f5b23	Disable recode loop. For VBR coding disable the recode loop for speeds > 0. Results pending. Change-Id: I2cd9a87c3fcbe39c05b954798d0671a4ca62c37f	2013-10-11 13:38:52 +01:00
Guillaume Martres	b364176c08	Prevent accidental changes to the previous frame mode_infos This is needed to fix mbgraph but shouldn't affect anything else Change-Id: I2f515052f62e348cd3794b7ff0c139802225ea95	2013-10-10 12:18:12 -07:00
Dmitry Kovalev	1e8fc24af8	Merge "Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers."	2013-10-10 10:49:27 -07:00
Jingning Han	4793324c16	Merge "Allow sub8x8 intra modes test for alt frame coding"	2013-10-10 09:00:08 -07:00
Jingning Han	03fe08ca30	Deprecate the use of PARTITION_INFO from encoder Use b_mode_info to store the inter prediction mode of sub8x8 block, in replacement of the use of partition_info. Remove redundant buffer update for partition_info. For bus_cif at 2000 kbps, this seem to make speed 0 about 1% faster. Change-Id: Id1b3be45e75a24fb4b42335ac480c23e440978f6	2013-10-09 09:23:52 -07:00
Dmitry Kovalev	c983c966cb	Removing inv_txm4x4_1_add and inv_txm4x4_add function pointers. We already have itxm_add member in MACROBLOCKD structure. Both inv_txm4x4_1_add and inv_txm4x4_add are just its special cases for different eob values. But eob logic is already implemented in vp9_iwht4x4_add and vp9_idct4x4_add (that's why also removing inverse_transform_b_4x4_add). Change-Id: I80bec9b6f7d40c5e5033c613faca5c819c3e6326	2013-10-08 11:27:56 -07:00
Yaowu Xu	e29137df05	Change to allow less rectangular partion check For CpuUsed 1 & 2, this commit allow to skip retangular partition check when NONE is better than SPLIT. It also changed to allow such logic on alt ref frame coding rather than use square partition all them. The change has gain compressio about .3% on yt and ythd for both 1&2, It helped .6% compression on cif and stdhd for both CpuUsed 1&2. Change-Id: I814b653baf89f59acd20e042629a12938a1bd4e5	2013-10-08 08:12:56 -07:00
Jim Bankoski	2b491c19b8	Merge "cpplint errors in vp9_onyx_if.h"	2013-10-07 14:47:21 -07:00
Jim Bankoski	7eb7dd2fed	cpplint errors in vp9_onyx_if.h Slightly bigger change -> broke up encode_frame_to_datarate, lots of line length fixes. Change-Id: I7c53325e954de130f3fe1a6656626efc6705be82	2013-10-07 13:57:20 -07:00
Dmitry Kovalev	9dba044be2	Merge "Giving consistent names to IDCT/IWHT functions."	2013-10-05 23:44:05 -07:00
Jingning Han	0d0ed6a29b	Allow sub8x8 intra modes test for alt frame coding This commit allows sub8x8 intra modes test in the rate-distortion loop for hd sequences in speed 1 and 2. For sequence y90n of hd set at 8000 kbps, speed 2 runtime goes from 207s to 210s. For ped_1080p at 3000 kbps, speed 2 runtim goes from 336s to 337s. Both are running with 300 frames. This improves compression performance by 0.24% for stdhd and 0.32% for hd. Change-Id: I173ca38a6411565ae6cfadd184c42b2070c5de1f	2013-10-04 19:13:00 -07:00
Dmitry Kovalev	3a0602578e	Giving consistent names to IDCT/IWHT functions. The idea is to have the following names for each transform size: vp9_idct4x4_add vp9_idct4x4_1_add vp9_idct4x4_10_add vp9_idct4x4_16_add vp9_idct8x8_add vp9_idct8x8_1_add vp9_idct8x8_10_add vp9_idct8x8_64_add etc for 16x16, 32x32 The actual list of renames in this patch: vp9_idct_add_lossless -> vp9_iwht4x4_add vp9_short_iwalsh4x4_add -> vp9_iwht4x4_16_add vp9_short_iwalsh4x4_1_add -> vp9_iwht4x4_1_add vp9_idct_add -> vp9_idct4x4_add vp9_short_idct4x4_add -> vp9_idct4x4_16_add vp9_short_idct4x4_1_add -> vp9_idct4x4_1_add Change-Id: I6f43f7437c68dd30cdd05d72e213765578ed30b1	2013-10-04 14:17:06 -07:00
Paul Wilkins	44e039b4f5	Further clean up of speed 4 Speed 4 still does not give a big gain over speed 3. This just cleans it up a little from the last patch and comments out features that do not seem to be giving much benefit. Change-Id: I5f366e6160e1dbe5dc45cf5eb90cc02712baa1b6	2013-10-04 16:57:24 +01:00
Paul Wilkins	de6ecc5ac3	Selective masking of split modes. Allow selective masking of individual split modes rather than just a single on / off flag. For speed 2 recovers the large speed loss seen for some derf clips in change Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 and a small quality gain. For speed 1 10 % speed increase observed locally on some derf clips for minimal quality change. Change-Id: If86191087b93cbc05351c26c60c7933e2149e485	2013-10-04 14:20:58 +01:00
Paul Wilkins	03dd2818e4	Missing threshold case for disable split. In relation to change: Refactor inter mode rate-distortion search Ie6bdfa0a370148dd60bd800961077f7e97e67dd4 sf->thresh_mult_sub8x8[THR_INTRA] = INT_MAX missing; Change-Id: Ia86b68a5073368a3e2ca124a27b632243b525c8b	2013-10-04 11:54:24 +01:00
Jingning Han	11abab356e	Refactor inter mode rate-distortion search This commit separates the rate-distortion optimization loop of superblocks from that of sub8x8 blocks. This allows better design rate-distortion optimization search loop for each setting. It also removes the use of SPLITMV and I4X4_PRED therein. No performance change in speed 0 settings. For bus@CIF at 2000kbps, the speed 1 runtime goes from 48009ms to 43894ms (about 10% faster). The overall compression performance on derf changed by -0.021%. Speed 2 runtime goes from 27114ms to 28700ms (6% slower), while the overall coding efficiency goes up by 1.629% for derf, 1.236% for yt. Change-Id: Ie6bdfa0a370148dd60bd800961077f7e97e67dd4	2013-10-03 11:36:49 -07:00
Paul Wilkins	6253cc9279	Speed setting review. Substantial reworking of the speed vs quality trade offs for speed 1 and 2. In this patch I am attempting to freeze the "quality" meaning of speeds 1 and 2 relative to speed 0 so that in future we can better evaluate progress. I am targeting : Speed 1 quality ~-5% vs speed 0. Speed 2 quality ~-10% vs speed 0 It is inevitable that quality will still fluctuate a little as we adjust settings and add new features, but we will attempt to keep as close as possible to these values. Above speed 2 things will remain a bit more fluid for now. In this patch speed 1 is approximately 4-5x as fast as speed 0. This is similar to before but the quality hit is a lot less. Likewise speed 2 is approximately 2x as fast as speed 1 but is similar in quality to the previous speed 1 configuration. Also slight change to behavior of FLAG_EARLY_TERMINATE to insure all reference frames get at least one rd test. Important for very low variance regions. WIP :- Added a new speed level with old speed 4 becoming speed 5. Speed 3 and 4 tradeoffs still WIP Change-Id: Ic7a38dd7b5b63ab1501f9352411972f480ac6264	2013-10-03 10:23:28 +01:00
Paul Wilkins	ece99b3da0	Merge "Improved auto_partition_range."	2013-10-03 02:06:13 -07:00
Jingning Han	54bc73151b	Deprecate unused mode count variables Remove mode_check_freq and mode_test_hit_counts from VP9_COMP. Change-Id: Iabfd9f841444cd9bf19ac761a9795f140082ce0b	2013-10-02 11:07:14 -07:00
Paul Wilkins	d12a502ef9	Merge "Alter Speed 3."	2013-09-30 09:12:28 -07:00
Paul Wilkins	65b93c7e52	Improved auto_partition_range. The code now takes into account temporal and spatial information to determine the partition size range, but the frequency counts have been removed. The net effect is similar in quality but about 10% faster. Change-Id: I39a513fb79cec9177b73b2a7218f0da70963ae95	2013-09-30 11:32:57 +01:00
Paul Wilkins	a76caa7ff4	Alter Speed 3. This patch deletes the variance based speed three partitioning. Speed 3 now uses the same partitioning method as speed 2 but with some stricter conditions. The speed and quality are now somewhere between speeds 2 and 4 whereas before it was worse in both than speed 4. Change-Id: Ia142e7007299d79db3ceee6ca8670540db6f7a41	2013-09-30 11:26:46 +01:00
Deb Mukherjee	80d582239e	Some minor changes/cleanups in rate control Some small changes to the quantizer mapping functions. Also includes some cleanups. Change-Id: I9dea29b24015f6e6697012a0e4d8983049d8e5c7 Results: derfraw300: +0.106% stdhdraw250: +0.139%	2013-09-27 13:57:42 -07:00
Deb Mukherjee	b7a93578e5	Small tweak in the constant quality parameter Improves results a little. Change-Id: I7bcac02dbb65b43a993445cf557c520197114e5c	2013-09-24 09:09:35 -07:00
Deb Mukherjee	d11221f433	Improves constant qual, constrained qual turned on Adds modeled functions to decide the qp for altref frames in constant q mode similar to other functions in use in bitrate mode. Also turns on the constrained quality mode (end-usage=2) option which was turned off before. Basic testing shows the mode works in principle, to cap bitrate to the target-bitrate specified, while allowing lower bitrate depending on the cq-level specified. The mode will need to be improved over time. Results for constant quality vs bitrate control mode: derfraw300/fullderfraw: +3.0% at constant quality over bitrate control. fullstdhdraw: +4.341% stdhdraw250: +5.361% Change-Id: If5027c9ec66c8e88d33e47062c6cb84a07b1cda9	2013-09-22 23:04:50 -07:00
Paul Wilkins	cb50dc7f33	Minor clean up. Removed some unused code and minor cleanup / reordering. Change-Id: I4083ae56aeb8edfe9b85aa2f42a16aa28d19da94	2013-09-16 13:45:20 +01:00
Paul Wilkins	3b01778450	Adjustment to mode_skip_start. Corrected values relating to modified mode order. Change-Id: I24fccba3af4bc16721d5e7e51888a66305bfa7fe	2013-09-16 13:44:48 +01:00
Jingning Han	e8a967d960	Merge "Adaptive motion search control"	2013-09-13 14:43:23 -07:00
Jingning Han	c4826c5941	Adaptive motion search control This commit enables adaptive constraint on motion search range for smaller partitions, given the motion vectors of collocated larger partition as a candidate initial search point. It makes speed 0 runtime of bus at CIF and 2000 kbps goes from 167s down to 162s (3% speed-up), at 0.01dB performance gains. In the settings of speed 1, this makes the runtime goes from 33687 ms to 32142 ms (4.5% speed-up), at 0.03dB performance gains. Compression performance wise, it gains at speed 1: derf 0.118% yt 0.237% hd 0.203% stdhd 0.438% Change-Id: Ic8b34c67810d9504a9579bef2825d3fa54b69454	2013-09-13 13:58:10 -07:00
Deb Mukherjee	0c3038234d	Merge "Clean up of the search best filter speed feature"	2013-09-13 11:03:59 -07:00
Scott LaVarnway	8fc95a1b11	Merge "New mode_info_context storage -- undo revert"	2013-09-13 08:56:20 -07:00
Deb Mukherjee	b964646756	Clean up of the search best filter speed feature Removes this speed feature since it is very slow and unlikely to be used in practice. This cleanup removes a bunch of unnecessary complications in the outer encode loop. Change-Id: I3c66ef1ca924fbfad7dadff297c9e7f652d308a1	2013-09-11 15:16:36 -07:00
Deb Mukherjee	69fe840ec4	Changes in speed 2 settings Propose some changes to the speed 2 settings to improve quality. In particular, turns off the adjust_thresholds_by_speed feature which improves results by 6%. Also removes the code for adjust_thresholds_by_speed since it conflicts with the adaptive rd thresh feature. Overall, with this change speed 2 is -15.2% from speed 0 settings, on derf, which is significantly better than -21.6% down before. Change-Id: I6e90a563470979eb0c258ec32d6183ed7ce9a505	2013-09-11 10:54:07 -07:00
Scott LaVarnway	ac6093d179	New mode_info_context storage -- undo revert mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of pointers to MODE_INFO structs. The MODE_INFO structs are now stored as a stream (decoder only), eliminating unnecessary copies and is a little more cache friendly. Change-Id: I031d376284c6eb98a38ad5595b797f048a6cfc0d	2013-09-11 13:45:44 -04:00
Deb Mukherjee	3d22d3ae0c	Merge "Small tweaks on the constant quality mode"	2013-09-10 11:16:47 -07:00
Deb Mukherjee	09830aa0ea	Small tweaks on the constant quality mode Improves results a little. derf is now +1.078% over bitrate control. Change-Id: I4812136f3e67be21d14ec089419976a32a841785	2013-09-10 10:16:19 -07:00
Yunqing Wang	939791a129	Modify encode breakout for static frames Thank Paul for the suggestions. While turning on static-thresh for static-image videos, a big jump on bitrate was seen. In this patch, we detected static frames in the video using first-pass stats. For different cases, disable encode breakout or reduce encode breakout threshold to limit the skipping. More modification need be done to break incorrect partition picking pattern for static frames while skipping happens. Change-Id: Ia25f47041af0f04e229c70a0185e12b0ffa6047f	2013-09-10 09:06:03 -07:00
Paul Wilkins	4f660cc018	Modified mode skip functionality. A previous speed feature skipped modes not used in earlier partitions but this not longer worked as intended following changes to the partition coding order and in conjunction with some other speed features (Especially speed 2 and above). This modified mode skip feature sets a mask after the first X modes have been tested in each partition depending on the reference frame of the current best case. This patch also makes some changes to the order modes are tested to fit better with this skip functionality. Initial testing suggests speed and rd hit count improvements of up to 20% at speed 1. Quality results. (derf -1.9%, std hd +0.23%). Change-Id: Idd8efa656cbc0c28f06d09690984c1f18b1115e1	2013-09-10 13:30:10 +01:00
Ivan Maltz	20abe595ec	Merge "API extensions and sample app for spacial scalable encoder"	2013-09-09 16:57:01 -07:00
Ivan Maltz	01b35c3c16	API extensions and sample app for spacial scalable encoder Sample app: vp9_spatial_scalable_encoder vpx_codec_control extensions: VP9E_SET_SVC VP9E_SET_WIDTH, VP9E_SET_HEIGHT, VP9E_SET_LAYER VP9E_SET_MIN_Q, VP9E_SET_MAX_Q expanded buffer size for vp9_convolve modified setting of initial width in vp9_onyx_if.c so that layer size can be set prior to initial encode Default number of layers set to 3 (VPX_SS_DEFAULT_LAYERS) Number of layers set explicitly in vpx_codec_enc_cfg.ss_number_layers Change-Id: I2c7a6fe6d665113671337032f7ad032430ac4197	2013-09-09 15:57:56 -07:00
James Zern	c1913c9cf4	Merge "Revert "New mode_info_context storage""	2013-09-09 14:38:01 -07:00
James Zern	54a03e20dd	Revert "New mode_info_context storage" This reverts commit `dae17734ec` Encode crashes, leaks and increases integer overflow errors. Change-Id: I595aa2649bb8d0b6552ff91652837a74c103fda2	2013-09-09 13:37:01 -07:00
Paul Wilkins	740acd6891	Merge "Enable kf restrictions at speed 4"	2013-09-09 05:39:13 -07:00
Jim Bankoski	e378566060	Merge "New mode_info_context storage"	2013-09-08 07:16:25 -07:00
Paul Wilkins	f15cdc7451	Enable kf restrictions at speed 4 Change-Id: I453409d3be3f5fe118b15affde45cb52184aef20	2013-09-06 11:16:04 -07:00
Deb Mukherjee	e378a89bd6	Support a constant quality mode in VP9 Adds a new end-usage option for constant quality encoding in vpx. This first version implemented for VP9, encodes all regular inter frames using the quality specified in the --cq-level= option, while encoding all key frames and golden/altref frames at a quality better than that. The current performance on derfraw300 is +0.910% up from bitrate control, but achieved without multiple recode loops per frame. The decision for qp for each altref/golden/key frame will be improved in subsequent patches based on better use of stats from the first pass. Further, the qp for regular inter frames may also be varied around the provided cq-level. Change-Id: I6c4a2a68563679d60e0616ebcb11698578615fb3	2013-09-06 10:30:53 -07:00
Scott LaVarnway	dae17734ec	New mode_info_context storage mode_info_context was stored as a grid of MODE_INFO structs. The grid now constists of a pointer to a MODE_INFO struct and a "in the image" flag. The MODE_INFO structs are now stored as a stream, eliminating unnecessary copies and is a little more cache friendly. For the test clips used, the decoder performance improved by ~4.3% (1080p) and ~9.7% (720p). Patch Set 2: Re-encoded clips with latest. Now ~1.7% (1080p) and 5.9% (720p). Change-Id: I846f29e88610fce2523ca697a9a9ef2a182e9256	2013-09-06 12:33:34 -04:00

1 2 3 4 5 ...

615 Коммитов