- 01 Dec, 2020 8 commits
-
-
Ondřej Nový authored
-
Ondřej Nový authored
-
Ondřej Nový authored
-
Ondřej Nový authored
-
Ondřej Nový authored
-
Ondřej Nový authored
-
Ondřej Nový authored
-
Ondřej Nový authored
ISA-L 2.30 release
-
- 07 Nov, 2020 1 commit
-
-
Greg Tucker authored
Change-Id: If6d696ee76f3949d3cf5aff34403df65bce2c6b9 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 04 Nov, 2020 3 commits
-
-
Greg Tucker authored
Change-Id: Icbb1faa2b67d8d18b1c7cde9f09774ebd895a6df Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Change-Id: I58401ed26ba8a0a7fad0191b4c1bbb461d0311e6 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Clang with sanitizer on was catching on cast of static header. Switching to uload64 macro for better general solution. Change-Id: I495d440407bb1773841e2f7cdc48bd95fc1a2df4 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 03 Nov, 2020 1 commit
-
-
Greg Tucker authored
In the newly added function isal_deflate_process_dict(), a null check was added to the dictionary struct but was ineffectual because of the order. Change-Id: I3b3e70997210794de102b1348e1467295871cee2 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 29 Oct, 2020 2 commits
-
-
Greg Tucker authored
Change-Id: I83e62344fab72afd755453d4eb43e9c236ba2b86 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Change-Id: I0b0a151374acfe9b44c7a2be4bb959df59356d97 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 22 Oct, 2020 6 commits
-
-
Greg Tucker authored
Change-Id: Id55728fea286d144f8a11192ab02ccc8503d7b25 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Change-Id: I515217b19373b8f996ff887268862cf2b102f3a4 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Change-Id: Ie2edf8e742d0bcdd9a008704f997006f8f5009ac Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Change-Id: I4eed03bdb91030b16b3ecfd8076adc890e4f59a2 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Change-Id: I4b0dac1e5aa2796be17644b893e3b6c7aed05876 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Performance improvement for inflate to skip the time-consuming process of decode table expansion when the header matches a known common dymanic one such as produced by level 0 compression. Change-Id: Ia2550b812a062b7cc2eb1b72bcb609f1a631e40b Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 27 Aug, 2020 1 commit
-
-
Greg Tucker authored
Change-Id: I36c3616163f6fec61dda9cf8b35ca561e59477c9 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 26 Aug, 2020 2 commits
-
-
Greg Tucker authored
Change-Id: I1180ba749d54e7ef433b01b33450e52ac5dbb2bb Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Previous new crc version missed the update for nmake. Change-Id: Ie529ee9d70d8d0ab8a8af3bd2720405802180d1e Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 22 Aug, 2020 1 commit
-
-
Greg Tucker authored
Change-Id: I1c509c6ea312b6eb4e1c2c1c8bb7044f7b043e0d Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 13 Aug, 2020 2 commits
-
-
Greg Tucker authored
Change-Id: I2beade6682e78fda30a18228a8660201ae7bf718 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
Greg Tucker authored
Issue with reading header only appears when combined with new feature in cli of multiple concatenated gzip files. Change-Id: Id8df9150c6f27d8b22e810b511291f3fcf136723 Signed-off-by:Greg Tucker <greg.b.tucker@intel.com>
-
- 27 Jul, 2020 1 commit
-
-
Ruben Vorderman authored
This will make it easier for users to get the latest version. Installing with conda is easier than compiling it yourself. Distro packages (such as Debian's) do not always ship the latest version while conda-forge can. This badge will advertise this install method. Change-Id: I99a1853a00e55fdf0c574c9906675738ac278121 Signed-off-by:Ruben Vorderman <r.h.p.vorderman@lumc.nl>
-
- 09 Jul, 2020 1 commit
-
-
Jerry Yu authored
Tweak performances with prefetch instructions. Below is the test results: - Neoverse N1: ~30% - Cortex-A72: ~3% - Cortex-A57: ~90% - Others: 50% - 5x Change-Id: I3ab292a953043dbaea98af3c66778f57da3a1331 Signed-off-by:Jerry Yu <jerry.h.yu@arm.com>
-
- 03 Jun, 2020 1 commit
-
-
Jerry Yu authored
Arm64 and ppc64 build reports below error: "configure: error: conditional "INTEL_CET_ENABLED" was never defined." And the error should be report in all non-x86 platform. Change-Id: I4c1b2fc99091424cfd5c62cf4d6536222b66712d Signed-off-by:Jerry Yu <jerry.h.yu@arm.com>
-
- 27 May, 2020 1 commit
-
-
H.J. Lu authored
We should generate .note.gnu.property section with x86 assembly codes for ELF outputs to mark Intel CET support when Intel CET is enabled since all input files must be marked with Intel CET support in order for linker to mark output with Intel CET support. Since nasm and yasm can't generate the proper .note.gnu.property section, yasm-cet-filter.sh and yasm-filter.sh are added to generate the proper .note.gnu.property with linker help. Verified with $ CC="gcc -Wl,-z,cet-report=error -fcf-protection" CXX="g++ -Wl,-z,cet-report=error -fcf-protection" .../configure x86_64-linux $ make -j8 on Linux/x86-64. Change-Id: I14e03a8a9031c8397dc36939a528cf5a827d775a Signed-off-by:H.J. Lu <hjl.tools@gmail.com>
-
- 26 May, 2020 1 commit
-
-
H.J. Lu authored
To support Intel CET, all indirect branch targets must start with ENDBR32/ENDBR64. Here is a patch to define endbranch and add it to function entries in x86 assembly codes which are indirect branch targets as discovered by running testsuite on Intel CET machine and visual inspection. Verified with $ CC="gcc -Wl,-z,cet-report=error -fcf-protection" CXX="g++ -Wl,-z,cet-report=error -fcf-protection" .../configure x86_64-linux $ make -j8 $ make -j8 check with both nasm and yasm on both CET and non-CET machines. Change-Id: I9822578e7294fb5043a64ab7de5c41de81a7d337 Signed-off-by:H.J. Lu <hjl.tools@gmail.com>
-
- 09 May, 2020 1 commit
-
-
Zhiyuan Zhu authored
Change-Id: Ib3bf04215cca491db522ec33905fe48df173cc2f Signed-off-by:Zhiyuan Zhu <zhiyuan.zhu@arm.com>
-
- 16 Apr, 2020 1 commit
-
-
Jerry Yu authored
To reduce the cache missing events, the mix layout is changed to PMULL+CRC. It also relaxes the final delay caused by data dependency. As results, the cold perf was improved about 20% and warm perf was improved about 4%. Change-Id: I7756f846edcb4f1665b4643a5a0e02283938cfdf Signed-off-by:Jerry Yu <jerry.h.yu@arm.com>
-
- 10 Apr, 2020 1 commit
-
-
Jerry Yu authored
Change-Id: I7c8a2348441f32a43ff386122612405e418d9947 Signed-off-by:Jerry Yu <jerry.h.yu@arm.com>
-
- 08 Apr, 2020 2 commits
-
-
Jerry Yu authored
Hardware folding algorithm depend on CRC32 and PMULL instruction. And it should match both flags . Change-Id: I361068402db1fe6d7c0bd8d2c7048f1d94880233 Signed-off-by:Jerry Yu <jerry.h.yu@arm.com>
-
Jerry Yu authored
Change-Id: Ib1658fd4b87b31d8ea6c93f697b50d9b409c186e Signed-off-by:Jerry Yu <jerry.h.yu@arm.com>
-
- 31 Mar, 2020 1 commit
-
-
Greg Tucker authored
The nmake default is changed for a modern nasm. Older nasm and yasm versions will still work with windows but the nmake options must be changed appropriately for max AS_FEATURE_LEVEL to match. Also now generates debug symbol pdb files. Change-Id: I94a2dd7ecf541c6564ccbd4a184c33995d7b31ad Signed-off-by:
Poornima Kumar <poornima.kumar@intel.com> Signed-off-by:
Greg Tucker <greg.b.tucker@intel.com>
-
- 30 Mar, 2020 2 commits
-
-
Jerry Yu authored
This patch is base on reference(1) algorithm with some changes. - Redefine the block number to two. - That's due to only two pipe-line can be used in CRC32 calculate. - Redefine the block size: - The block size of CRC is 1536B and PMULL is 512B - Interleave CRC and PMULL instructions. The optimization parameters are calculated base on reference(2) References: - https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/fast-crc-computation-generic-polynomials-pclmulqdq-paper.pdf - https://developer.arm.com/docs/swog309707/a Change-Id: I1c9e593d59b521f56e4b3c807b396c083c181636 Signed-off-by:
Jerry Yu <jerry.h.yu@arm.com>
-
Jerry Yu authored
This patch provides microarchitecture information and make microarchitecture optimization possible. It will trap into kernel due to mrs instruction. So it should be called only in dispatcher, that will be called only once in program lifecycle. And HWCAP must be match,That will make sure there are no illegal instruction errors. Change-Id: I393ec742010bf3f10ce335482c0350aa4202c788 Signed-off-by:Jerry Yu <jerry.h.yu@arm.com>
-