WSL2-Linux-Kernel

История

Dave Hansen 7d06d9c9bd mm: Implement new pkey_mprotect() system call pkey_mprotect() is just like mprotect, except it also takes a protection key as an argument. On systems that do not support protection keys, it still works, but requires that key=0. Otherwise it does exactly what mprotect does. I expect it to get used like this, if you want to guarantee that any mapping you create can never be accessed without the right protection keys set up. int real_prot = PROT_READ\|PROT_WRITE; pkey = pkey_alloc(0, PKEY_DENY_ACCESS); ptr = mmap(NULL, PAGE_SIZE, PROT_NONE, MAP_ANONYMOUS\|MAP_PRIVATE, -1, 0); ret = pkey_mprotect(ptr, PAGE_SIZE, real_prot, pkey); This way, there is no window where the mapping is accessible since it was always either PROT_NONE or had a protection key set that denied all access. We settled on 'unsigned long' for the type of the key here. We only need 4 bits on x86 today, but I figured that other architectures might need some more space. Semantically, we have a bit of a problem if we combine this syscall with our previously-introduced execute-only support: What do we do when we mix execute-only pkey use with pkey_mprotect() use? For instance: pkey_mprotect(ptr, PAGE_SIZE, PROT_WRITE, 6); // set pkey=6 mprotect(ptr, PAGE_SIZE, PROT_EXEC); // set pkey=X_ONLY_PKEY? mprotect(ptr, PAGE_SIZE, PROT_WRITE); // is pkey=6 again? To solve that, we make the plain-mprotect()-initiated execute-only support only apply to VMAs that have the default protection key (0) set on them. Proposed semantics: 1. protection key 0 is special and represents the default, "unassigned" protection key. It is always allocated. 2. mprotect() never affects a mapping's pkey_mprotect()-assigned protection key. A protection key of 0 (even if set explicitly) represents an unassigned protection key. 2a. mprotect(PROT_EXEC) on a mapping with an assigned protection key may or may not result in a mapping with execute-only properties. pkey_mprotect() plus pkey_set() on all threads should be used to _guarantee_ execute-only semantics if this is not a strong enough semantic. 3. mprotect(PROT_EXEC) may result in an "execute-only" mapping. The kernel will internally attempt to allocate and dedicate a protection key for the purpose of execute-only mappings. This may not be possible in cases where there are no free protection keys available. It can also happen, of course, in situations where there is no hardware support for protection keys. Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com> Acked-by: Mel Gorman <mgorman@techsingularity.net> Cc: linux-arch@vger.kernel.org Cc: Dave Hansen <dave@sr71.net> Cc: arnd@arndb.de Cc: linux-api@vger.kernel.org Cc: linux-mm@kvack.org Cc: luto@kernel.org Cc: akpm@linux-foundation.org Cc: torvalds@linux-foundation.org Link: http://lkml.kernel.org/r/20160729163012.3DDD36C4@viggo.jf.intel.com Signed-off-by: Thomas Gleixner <tglx@linutronix.de>		2016-09-09 13:02:26 +02:00
..
kasan	kasan: remove the unnecessary WARN_ONCE from quarantine.c	2016-08-11 16:58:14 -07:00
Kconfig	mm: clarify COMPACTION Kconfig text	2016-08-26 17:39:35 -07:00
Kconfig.debug	…
Makefile	…
backing-dev.c	…
balloon_compaction.c	…
bootmem.c	…
cleancache.c	…
cma.c	…
cma.h	…
cma_debug.c	…
compaction.c	…
debug.c	…
debug_page_ref.c	…
dmapool.c	…
early_ioremap.c	…
fadvise.c	…
failslab.c	…
filemap.c	…
frame_vector.c	…
frontswap.c	…
gup.c	…
highmem.c	…
huge_memory.c	soft_dirty: fix soft_dirty during THP split	2016-08-26 17:39:35 -07:00
hugetlb.c	mm/hugetlb: fix incorrect hugepages count during mem hotplug	2016-08-11 16:58:13 -07:00
hugetlb_cgroup.c	…
hwpoison-inject.c	…
init-mm.c	…
internal.h	…
interval_tree.c	…
khugepaged.c	…
kmemcheck.c	…
kmemleak-test.c	…
kmemleak.c	…
ksm.c	…
list_lru.c	…
maccess.c	…
madvise.c	…
memblock.c	…
memcontrol.c	mm: memcontrol: avoid unused function warning	2016-08-26 17:39:35 -07:00
memory-failure.c	…
memory.c	…
memory_hotplug.c	mm/memory_hotplug.c: initialize per_cpu_nodestats for hotadded pgdats	2016-08-11 16:58:14 -07:00
mempolicy.c	mm, mempolicy: task->mempolicy must be NULL before dropping final reference	2016-09-01 17:52:01 -07:00
mempool.c	…
memtest.c	…
migrate.c	…
mincore.c	…
mlock.c	…
mm_init.c	…
mmap.c	…
mmu_context.c	…
mmu_notifier.c	…
mmzone.c	…
mprotect.c	mm: Implement new pkey_mprotect() system call	2016-09-09 13:02:26 +02:00
mremap.c	…
msync.c	…
nobootmem.c	…
nommu.c	…
oom_kill.c	mm, oom: fix uninitialized ret in task_will_free_mem()	2016-08-11 16:58:14 -07:00
page-writeback.c	…
page_alloc.c	mm, vmscan: only allocate and reclaim from zones with pages managed by the buddy allocator	2016-09-01 17:52:01 -07:00
page_counter.c	…
page_ext.c	…
page_idle.c	…
page_io.c	…
page_isolation.c	…
page_owner.c	…
page_poison.c	…
pagewalk.c	…
percpu-km.c	…
percpu-vm.c	…
percpu.c	…
pgtable-generic.c	…
process_vm_access.c	…
quicklist.c	…
readahead.c	mm: silently skip readahead for DAX inodes	2016-08-26 17:39:35 -07:00
rmap.c	rmap: fix compound check logic in page_remove_file_rmap	2016-08-10 16:40:56 -07:00
shmem.c	…
slab.c	…
slab.h	…
slab_common.c	…
slob.c	…
slub.c	mm/slub.c: run free_partial() outside of the kmem_cache_node->list_lock	2016-08-10 16:40:56 -07:00
sparse-vmemmap.c	…
sparse.c	…
swap.c	…
swap_cgroup.c	…
swap_state.c	…
swapfile.c	…
truncate.c	…
usercopy.c	usercopy: fix overlap check for kernel text	2016-08-22 19:10:51 -07:00
userfaultfd.c	…
util.c	…
vmacache.c	…
vmalloc.c	…
vmpressure.c	…
vmscan.c	mm, vmscan: only allocate and reclaim from zones with pages managed by the buddy allocator	2016-09-01 17:52:01 -07:00
vmstat.c	…
workingset.c	…
z3fold.c	…
zbud.c	…
zpool.c	…
zsmalloc.c	…
zswap.c	…