update jemalloc to master #4

thestinger · 2014-09-12T05:29:49Z

No description provided.

By default, git will coerce LF to CRLF when files are checked out on Windows. This causes hard to diagnose errors when compiling with mingw-w64 from Windows rather than cross-compiling.

fix git handling of newlines on windows

Add new mallctl endpoints "arena<i>.chunk.alloc" and "arena<i>.chunk.dealloc" to allow userspace to configure jemalloc's chunk allocator and deallocator on a per-arena basis.

Refactor huge allocation to be managed by arenas (though the global red-black tree of huge allocations remains for lookup during deallocation). This is the logical conclusion of recent changes that 1) made per arena dss precedence apply to huge allocation, and 2) made it possible to replace the per arena chunk allocation/deallocation functions. Remove the top level huge stats, and replace them with per arena huge stats. Normalize function names and types to *dalloc* (some were *dealloc*). Remove the --enable-mremap option. As jemalloc currently operates, this is a performace regression for some applications, but planned work to logarithmically space huge size classes should provide similar amortized performance. The motivation for this change was that mremap-based huge reallocation forced leaky abstractions that prevented refactoring.

MSVC only supports the former.

test/integration/aligned_alloc.c needs it.

…oc.h

… jemalloc_internal_decls.h header

Sets `STATIC_PAGE_SHIFT` for cross-compiling jemalloc to 12. A shift of 12 represents a page size of 4k for practically all platforms.

Use nallocx() rather than mallctl() to trigger initialization, because nallocx() has no side effects other than initialization, whereas mallctl() does a bunch of internal memory allocation.

Add size class computation capability, currently used only as validation of the size class lookup tables. Generalize the size class spacing used for bins, for eventual use throughout the full range of allocation sizes.

Fix KZI() and KQI() to append LL rather than ULL.

…e) C99 support

Optimize [nmd]alloc() fast paths such that the (flags == 0) case is streamlined, flags decoding only happens to the minimum degree necessary, and no conditionals are repeated.

Move typedefs from jemalloc_protos.h.in to jemalloc_typedefs.h.in, so that typedefs aren't redefined when compiling stress tests.

It hits a compilation error with glibc 2.19 without a rename.

avoid conflict with the POSIX timer_t type

This adds a new `sdallocx` function to the external API, allowing the size to be passed by the caller. It avoids some extra reads in the thread cache fast path. In the case where stats are enabled, this avoids the work of calculating the size from the pointer. An assertion validates the size that's passed in, so enabling debugging will allow users of the API to debug cases where an incorrect size is passed in. The performance win for a contrived microbenchmark doing an allocation and immediately freeing it is ~10%. It may have a different impact on a real workload. Closes jemalloc#28

fix isqalloct (should call isdalloct)

- Add a --thread N option to select profile for thread N (otherwise, all threads will be printed) - The $profile map now has a {threads} element that is a map from thread id to a profile that has the same format as the {profile} element - Refactor ReadHeapProfile into smaller components and use them to implement ReadThreadedHeapProfile

Refactor sdallocx() and nallocx() to share inallocx(), and fix an sdallocx() assertion to check usize rather than size.

Fix ReadThreadedHeapProfile to pass the correct parameters to AdjustSamples.

Fix prof_tdata_get() to avoid dereferencing an invalid tdata pointer (when it's PROF_TDATA_STATE_{REINCARNATED,PURGATORY}). Fix prof_tdata_get() callers to check for invalid results besides NULL (PROF_TDATA_STATE_{REINCARNATED,PURGATORY}). These regressions were caused by 602c8e0 (Implement per thread heap profiling.), which did not make it into any releases prior to these fixes.

Fix a profile sampling race that was due to preparing to sample, yet doing nothing to assure that the context remains valid until the stats are updated. These regressions were caused by 602c8e0 (Implement per thread heap profiling.), which did not make it into any releases prior to these fixes.

* assertion failure * malloc_init failure * malloc not already initialized (in malloc_init) * running in valgrind * thread cache disabled at runtime Clang and GCC already consider a comparison with NULL or -1 to be cold, so many branches (out-of-memory) are already correctly considered as cold and marking them is not important.

Fix irallocx_prof() sample logic to only update the threshold counter after it knows what size the allocation ended up being. This regression was caused by 6e73dc1 (Fix a profile sampling race.), which did not make it into any releases prior to this fix.

Don't use atomic_add_uint64(), because it isn't available on 32-bit platforms. Fix forking support functions to manage all prof-related mutexes. These regressions were introduced by 602c8e0 (Implement per thread heap profiling.), which did not make it into any releases prior to these fixes.

alexcrichton · 2014-09-12T13:51:10Z

Pushed as https://github.com/rust-lang/jemalloc/tree/rust-2014-09-12-do-not-delete

thestinger and others added 30 commits May 7, 2014 18:48

fix git handling of newlines on windows

74b1ea5

By default, git will coerce LF to CRLF when files are checked out on Windows. This causes hard to diagnose errors when compiling with mingw-w64 from Windows rather than cross-compiling.

Merge pull request jemalloc#82 from thestinger/newline

4bbd11b

fix git handling of newlines on windows

Add support for user-specified chunk allocators/deallocators.

fb7fe50

Add new mallctl endpoints "arena<i>.chunk.alloc" and "arena<i>.chunk.dealloc" to allow userspace to configure jemalloc's chunk allocator and deallocator on a per-arena basis.

Merge branch 'pr/80' into dev

d4a238c

Minor doc edit.

b4d62cd

Fix manual dependency on jemalloc_test.h

ed0b0ec

Define _CRT_SPINCOUNT in test/src/mtx.c like in src/mutex.c

47d58a0

Define DLLEXPORT when building .jet objects

d6fd114

Replace variable arrays in tests with VARIABLE_ARRAY

f41f143

Add missing $(EXE) to filter TESTS_UNIT_AUX_OBJS

1ad4a6e

Use C99 varadic macros instead of GCC ones

7330c37

Rename "small" local variable, because windows headers #define it

86e2e70

Avoid pointer arithmetic on void* in test/integration/rallocx.c

3a730df

Use ULL prefix instead of LLU for unsigned long longs

a9df1ae

MSVC only supports the former.

Move __func__ to jemalloc_internal_macros.h

22bc570

test/integration/aligned_alloc.c needs it.

Use a configure test to detect the form of malloc_usable_size in mall…

affe009

…oc.h

Move platform headers and tricks from jemalloc_internal.h.in to a new…

12f74e6

… jemalloc_internal_decls.h header

Define INFINITY when it's not defined

26246af

Correctly return exit code from thd_join on Windows

17767b5

Fixup after 3a730df (Avoid pointer arithmetic on void*[...])

b54aef1

STATIC_PAGE_SHIFT for cross-compiling jemalloc

ccf0466

Sets `STATIC_PAGE_SHIFT` for cross-compiling jemalloc to 12. A shift of 12 represents a page size of 4k for practically all platforms.

Make sure initialization occurs prior to running tests.

26f44df

Use nallocx() rather than mallctl() to trigger initialization.

9911862

Use nallocx() rather than mallctl() to trigger initialization, because nallocx() has no side effects other than initialization, whereas mallctl() does a bunch of internal memory allocation.

Add size class computation capability.

d04047c

Add size class computation capability, currently used only as validation of the size class lookup tables. Generalize the size class spacing used for bins, for eventual use throughout the full range of allocation sizes.

Use KQU() rather than QU() where applicable.

1f6d77e

Fix KZI() and KQI() to append LL rather than ULL.

Fix thd_join on win64

999e1b5

Don't use msvc_compat's C99 headers with MSVC versions that have (som…

ff2e999

…e) C99 support

Add -FS flag to support parallel builds with MSVC 2013

8c61575

Make in-tree MSVC builds work

6f6704c

Jason Evans and others added 26 commits September 4, 2014 22:27

Whitespace cleanups.

c21b05e

Optimize [nmd]alloc() fast paths.

b718cf7

Optimize [nmd]alloc() fast paths such that the (flags == 0) case is streamlined, flags decoding only happens to the minimum degree necessary, and no conditionals are repeated.

Move typedefs from jemalloc_protos.h.in to jemalloc_typedefs.h.in.

82e88d1

Move typedefs from jemalloc_protos.h.in to jemalloc_typedefs.h.in, so that typedefs aren't redefined when compiling stress tests.

Add a simple timer implementation for use in benchmarking.

b67ec3c

Add microbench tests.

423d78a

avoid conflict with the POSIX timer_t type

c3bfe95

It hits a compilation error with glibc 2.19 without a rename.

Merge pull request jemalloc#114 from thestinger/timer

c54f93f

avoid conflict with the POSIX timer_t type

Thwart optimization of free(malloc(1)) in microbench.

a1f3929

Add relevant function attributes to [msn]allocx().

c3f8650

fix isqalloct (should call isdalloct)

a62812e

Merge pull request jemalloc#115 from thestinger/isqalloct

ffe9341

fix isqalloct (should call isdalloct)

Fix sdallocx() assertion.

a2260c9

Refactor sdallocx() and nallocx() to share inallocx(), and fix an sdallocx() assertion to check usize rather than size.

Fix threaded heap profile bug in pprof.

7c17e16

Fix ReadThreadedHeapProfile to pass the correct parameters to AdjustSamples.

Add sdallocx() to list of functions to prune in pprof.

61beeb9

add likely / unlikely macros

6b5609d

Fix mallocx() to always honor MALLOCX_ARENA() when profiling.

91566fc

Apply likely()/unlikely() to allocation/deallocation fast paths.

9c640bf

Fixed iOS build after OR1 changes

ebca69c

check in the configure script

2dba541

alexcrichton closed this Sep 12, 2014

alexcrichton mentioned this pull request Sep 12, 2014

Fixes building on iOS #5

Closed

thestinger deleted the rust-2014-09-12-do-not-delete branch October 2, 2014 04:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update jemalloc to master #4

update jemalloc to master #4

Uh oh!

thestinger commented Sep 12, 2014

Uh oh!

alexcrichton commented Sep 12, 2014

Uh oh!

Uh oh!

update jemalloc to master #4

update jemalloc to master #4

Uh oh!

Conversation

thestinger commented Sep 12, 2014

Uh oh!

alexcrichton commented Sep 12, 2014

Uh oh!

Uh oh!