mozilla/aom - aom

Граф коммитов

Автор	SHA1	Сообщение	Дата
Yunqing Wang	b9f19f8917	Adjust half-pixel only search Changed motion search in vp8_find_best_half_pixel_step() to be the same as in vp8_find_best_sub_pixel_step(), which checks 5 points instead of 8 points. This only affects real-time mode with cpu-used >=9. Tests showed it gives 2% encoding speedup with a quality loss(psnr) of up to 0.5%. Change-Id: I16049cad1535002346d46cfdfad345bfc3dc5146	2011-08-03 11:51:07 -04:00
John Koleszar	56b06aef6d	Merge "configure: add --enable-static option"	2011-07-28 07:08:35 -07:00
Yunqing Wang	2f2302f8d5	Preload reference area in sub-pixel motion search (real-time mode) This change implemented same idea in change "Preload reference area to an intermediate buffer in sub-pixel motion search." The changes were made to vp8_find_best_sub_pixel_step() and vp8_find_best_half _pixel_step() functions which are called when speed >= 5. Test result (using tulip clip): 1. On Core2 Quad machine(Linux) rt mode, speed (-5 ~ -8), encoding speed gain: 2% ~ 3% rt mode, speed (-9 ~ -11), encoding speed gain: 1% ~ 2% rt mode, speed (-12 ~ -14), no noticeable encoding speed gain 2. On Xeon machine(Linux) Test on speed (-5 ~ -14) didn't show noticeable speed change. Change-Id: I21bec2d6e7fbe541fcc0f4c0366bbdf3e2076aa2	2011-07-27 14:19:10 -04:00
Yunqing Wang	f11613b620	Merge "Fix range checks in motion search"	2011-07-27 09:34:13 -07:00
Yunqing Wang	bde2afbe23	Fix range checks in motion search There were some situations that the start motion vectors were out of range. This fix adjusted range checks to make sure they are checked and clamped. Change-Id: Ife83b7fed0882bba6d1fa559b6e63c054fd5065d	2011-07-27 10:37:33 -04:00
James Zern	3a975d9489	vpxenc: cosmetics: timebase help update / spelling The timebase update fixes Issue #61. Change-Id: I425158da7ea639464f61e6dd604ac9e6c72b7266	2011-07-26 17:27:01 -07:00
John Koleszar	db8f0d2ca9	Merge "cosmetics: consistently use [u]int64_t"	2011-07-26 12:57:43 -07:00
James Zern	b45065d38b	cosmetics: consistently use [u]int64_t Removes mixed usage of (unsigned) long long and INT64. Fixes Issue #208. Change-Id: I220d3ed5ce4bb1280cd38bb3715f208ce23cf83a	2011-07-26 11:34:36 -07:00
Johann	ca7e346669	Merge ""Eliminated TOKENEXTRABITS" broke the windows build."	2011-07-26 06:34:31 -07:00
Scott LaVarnway	a11624497c	"Eliminated TOKENEXTRABITS" broke the windows build. Fixed. Change-Id: I3348e8dbcaee6ace263af413701101d77636e5df	2011-07-26 09:33:16 -04:00
James Zern	495b241fa6	configure: add --enable-static option Fixes issue #62. Change-Id: I0567cf7897c0942666c19b3231c8c3b8e9c3e7cc	2011-07-25 15:40:36 -07:00
Scott LaVarnway	4894b45ced	Merge "Eliminated TOKENEXTRABITS"	2011-07-25 14:35:58 -07:00
Scott LaVarnway	76eb402668	Eliminated TOKENEXTRABITS Noticed small performance gains, depending on material. Change-Id: I334369f6312bc19aa73481fc3f790ab181e11867	2011-07-25 17:11:24 -04:00
Yunqing Wang	5b0de48ddd	Merge "Use CONFIG_FAST_UNALIGNED consistently in codec"	2011-07-25 12:40:50 -07:00
Yunqing Wang	fe270dd527	Specify size for argument pushed to stack The change fixes building error on Win64. Change-Id: I63d25b26220c4da8a98ca2e36530cbb802468e6b	2011-07-25 11:30:45 -04:00
Yunqing Wang	65dfcf4696	Use CONFIG_FAST_UNALIGNED consistently in codec CONFIG_FAST_UNALIGNED is enabled by default. Disable it if it is not supported by hardware. Change-Id: I7d6905ed79fed918bca074bd62820b0c929d81ab	2011-07-25 10:11:24 -04:00
Johann	773bcc300d	Merge "fix sharpness bug and clean up"	2011-07-22 09:34:55 -07:00
Johann	a04ed0e8f3	fix sharpness bug and clean up sharpness was not recalculated in vp8cx_pick_filter_level_fast remove last_filter_type. all values are calculated, don't need to update the lfi data when it changes. always use cm->sharpness_level. the extra indirection was annoying. don't track last frame_type or sharpness_level manually. frame type only matters for motion search and sharpness_level is taken care of in frame_init move function declarations to their proper header Change-Id: I7ef037bd4bf8cf5e37d2d36bd03b5e22a2ad91db	2011-07-22 12:33:57 -04:00
Yunqing Wang	829179e888	Merge "Preload reference area to an intermediate buffer in sub-pixel motion search"	2011-07-22 06:56:15 -07:00
Yunqing Wang	20bd1446c0	Preload reference area to an intermediate buffer in sub-pixel motion search In sub-pixel motion search, the search range is small(+/- 3 pixels). Preload whole search area from reference buffer into a 32-byte aligned buffer. Then in search, load reference data from this buffer instead. This keeps data in cache, and reduces the crossing cache- line penalty. For tulip clip, tests on Intel Core2 Quad machine(linux) showed encoder speed improvement: 3.4% at --rt --cpu-used =-4 2.8% at --rt --cpu-used =-3 2.3% at --rt --cpu-used =-2 2.2% at --rt --cpu-used =-1 Test on Atom notebook showed only 1.1% speed improvement(speed=-4). Test on Xeon machine also showed less improvement, since unaligned data access latency is greatly reduced in newer cores. Next, I will apply similar idea to other 2 sub-pixel search functions for encoding speed > 4. Make this change exclusively for x86 platforms. Change-Id: Ia7bb9f56169eac0f01009fe2b2f2ab5b61d2eb2f	2011-07-22 09:28:06 -04:00
Johann	52d13777da	Merge "Add .size directive to ARM asm functions."	2011-07-21 12:56:59 -07:00
Johann	ddcdbfd71e	Merge "Mark ARM asm objects as allowing a non-executable stack."	2011-07-21 12:20:00 -07:00
Timothy B. Terriberry	1647f00c29	Add .size directive to ARM asm functions. This makes them show up properly in debugging tools like gdb and valgrind. Change-Id: I0c72548a1090de88ba226314e5efe63360b7e07f	2011-07-21 11:46:14 -07:00
Timothy B. Terriberry	0453aca5af	Mark ARM asm objects as allowing a non-executable stack. This adds the magic .note.GNU-stack section at the end of each ARM asm file (when built with gas), indicating that a non-executable stack is allowed. Without this section, the linker will assume the object requires an executable stack by default, forcing an executable stack for the entire program. Change-Id: Ie86de6a449b52d392b9e5e0479833ed8c508ee65	2011-07-21 11:45:00 -07:00
John Koleszar	2bdda84e37	Merge "Increase chrow row alignment to 16 bytes."	2011-07-21 07:32:39 -07:00
Yunqing Wang	c5fe641179	Merge "Add improvements made in good-quality mode to real-time mode"	2011-07-21 07:27:09 -07:00
Timothy B. Terriberry	7d1b37cdac	Increase chrow row alignment to 16 bytes. This is done by expanding luma row to 32-byte alignment, since there is currently a bunch of code that assumes that uv_stride == y_stride/2 (see, for example, vp8/common/postproc.c, common/reconinter.c, common/arm/neon/recon16x16mb_neon.asm, encoder/temporal_filter.c, and possibly others; I haven't done a full audit). It also uses replaces the hardcoded border of 16 in a number of encoder buffers with VP8BORDERINPIXELS (currently 32), as the chroma rows start at an offset of border/2. Together, these two changes have the nice advantage that simply dumping the frame memory as a contiguous blob produces a valid, if padded, image. Change-Id: Iaf5ea722ae5c82d5daa50f6e2dade9de753f1003	2011-07-20 10:20:31 -07:00
Attila Nagy	0afcc76971	encoder: don't set the fragment bit for the last partition Change-Id: Icb4e4f0d7c3074a8507852178be87541a1cb5bac	2011-07-20 14:09:42 +03:00
Scott LaVarnway	b2d9700f53	Merge "Moved vp8_encode_bool into boolhuff.h"	2011-07-19 08:15:14 -07:00
John Koleszar	d98a5ed4dd	Revert "Disable __longjmp_chk protection" This reverts commit `b73a3693e5`. This version of the check doesn't work with generic-gnu, and figuring out the correct symbol version at configure time is probably more work than this is worth. May revisit in the future. Change-Id: I6c75e88bd3bd82a4b21e09a25780fe53aacb7d70	2011-07-19 10:00:27 -04:00
Johann	6afafc313c	remove old armv5 code armv5 dequantizer is not referenced Change-Id: Id1cc617dcee35ebd6a406816ec6aaa26e8bbc8ad	2011-07-19 09:20:38 -04:00
Scott LaVarnway	a25f6a9c88	Moved vp8_encode_bool into boolhuff.h allowing the compiler to inline this function. For real-time encodes, this gave a boost of 1% to 2.5%, depending on the speed setting. Change-Id: I3929d176cca086b4261267b848419d5bcff21c02	2011-07-19 09:17:25 -04:00
John Koleszar	b5ea2fbc2c	Improved 1-pass CBR rate control This patch attempts to improve the handling of CBR streams with respect to the short term buffering requirements. The "buffer level" is changed to be an average over the rc buffer, rather than a long running average. Overshoot is also tracked over the same interval and the golden frame targets suppressed accordingly to correct for overly aggressive boosting. Testing shows that this is fairly consistently positive in one metric or another -- some clips that show significant decreases in quality have better buffering characteristics, others show improvenents in both. Change-Id: I924c89aa9bdb210271f2e03311e63de3f1f8f920	2011-07-18 11:48:05 -04:00
John Koleszar	74ad25a4c6	Merge "Disable __longjmp_chk protection"	2011-07-18 08:43:59 -07:00
John Koleszar	da39e505dd	Merge "Fixed rate histogram calculation"	2011-07-18 06:07:51 -07:00
Tero Rintaluoma	fd41cb8491	Fixed rate histogram calculation Using small values for --buf-sz= in command line causes floating point exception due to division by zero. Change-Id: Ibfe2d44db922993a78ebc9a4a1087d9625de48ae	2011-07-18 10:35:05 +03:00
Scott LaVarnway	e68894fa03	Merge "Tokenize MB optimized"	2011-07-15 07:54:14 -07:00
Yunqing Wang	f676171e52	Merge "Fix vpxenc encoding incorrect webm file header on big endian machines(Issue 331)"	2011-07-15 05:21:35 -07:00
Tero Rintaluoma	4e82f01547	Tokenize MB optimized Optimized C-code of the following functions: - vp8_tokenize_mb - tokenize1st_order_b - tokenize2nd_order_b Gives ~1-5% speed-up for RT encoding on Cortex-A8/A9 depending on encoding parameters. Change-Id: I6be86104a589a06dcbc9ed3318e8bf264ef4176c	2011-07-15 11:26:54 +03:00
James Berry	6b6f367c3d	bug fix vpx_copy_and_extend_frame size issue vpx_copy_and_extend_frame could incorrectly resize uv frames which could result in a crash. Change-Id: Ie96f7078b1e328b3907a06eebeee44ca39a2e898	2011-07-14 15:58:15 -04:00
John Koleszar	04dce631a2	Remove unused speed features min_fs_radius, max_fs_radius, full_freq were set but never read. Change-Id: I82657f4e7f2ba2acc3cbc3faa5ec0de5b9c6ec74	2011-07-14 14:20:25 -04:00
Fritz Koenig	4ab3175b12	Merge "Better allocate yuv buffers."	2011-07-13 14:18:11 -07:00
Yunqing Wang	f1f28535c3	Merge "Fix unnecessary casting of B_PREDICTION_MODE (issue 349)"	2011-07-13 13:32:57 -07:00
John Koleszar	b73a3693e5	Disable __longjmp_chk protection glibc implements some checking on longjmp() calls by replacing it with an internal function __longjmp_chk(), when FORTIFY_SOURCE is defined. This can be problematic when compiling the library under one version of glibc and running it under another. Work around this issue for the one symbol affected for now, before taking out the undef hammer. Fixes http://code.google.com/p/webm/issues/detail?id=166 Change-Id: Ifc5e25cdec17915e394711f2185b3e9214572d10	2011-07-13 16:07:00 -04:00
Yunqing Wang	139577f937	Fix unnecessary casting of B_PREDICTION_MODE (issue 349) Minor fix. Change-Id: Iaf93f6e47e882a33c479e57c7a0d0bf321e291c0	2011-07-13 15:52:07 -04:00
Yunqing Wang	0e9a6ed72a	Add improvements made in good-quality mode to real-time mode Several improvements we made in good-quality mode can be added into real-time mode to speed up encoding in speed 1, 2, and 3 with small quality loss. Tests using tulip clip showed: --rt --cpu-used=-1 (before change) PSNR: 38.028 time: 1m33.195s (after change) PSNR: 38.014 time: 1m20.851s --rt --cpu-used=-2 (before change) PSNR: 37.773 time: 0m57.650s (after change) PSNR: 37.759 time: 0m54.594s --rt --cpu-used=-3 (before change) PSNR: 37.392 time: 0m42.865s (after change) PSNR: 37.375 time: 0m41.949s Change-Id: I76ab2a38d72bc5efc91f6fe20d332c472f6510c9	2011-07-13 14:51:02 -04:00
Fritz Koenig	e9751d4b74	Better allocate yuv buffers. Previously allocated more memory than necessary for yuv buffers. This makes it harder to track bugs with reading uninitialized data. Change-Id: I510f7b298d3c647c869be6e5d51608becc63cce9	2011-07-13 10:37:15 -07:00
Fritz Koenig	84c3cd79d1	Merge "Reduce motion vector search on alt-ref frame."	2011-07-13 10:07:30 -07:00
John Koleszar	7f0b11c0ae	Merge "Remove rotting NDS_NITRO code."	2011-07-13 05:46:30 -07:00
Johann	211694f67e	Merge "update x86 asm for loopfilter"	2011-07-13 04:10:03 -07:00

1 2 3 4 5 ...

1231 Коммитов Все ветки Поиск

1231 Коммитов

Все ветки