Liu Song’s Projects

Hash	Date	Commit message	Author
ae44e23e	2023-03-22 07:47:15	When seed <= 0 - use the clock to generate one	Georgi Gerganov
928480ef	2023-03-22 07:45:00	Init llama_context_params properly from CLI (#370)	Georgi Gerganov
56817b1f	2023-03-22 07:34:02	Remove temporary notice and update hot topics	Georgi Gerganov
f5a77a62	2023-03-22 07:32:36	Introduce C-style API (#370)	Georgi Gerganov
da0e9fe9	2023-03-20 20:14:06	Add SHA256SUMS file and instructions to README how to obtain and verify the downloads	Gary Mulder
e6c9e098	2023-03-21 23:49:24	Fix bin dir for win ci	anzz1
01a297b0	2023-03-21 22:34:25	specify build type for ctest on windows (#371)	Erik Scholz
3366853e	2023-03-21 22:57:35	Add notice about pending change	Georgi Gerganov
3f9c6135	2023-03-21 16:52:27	fix typo in chatLLaMa (#368)	Mathieu Nayrolles
0f613527	2023-03-21 19:47:27	Update issue templates	Georgi Gerganov
353ec251	2023-03-21 14:21:50	We could use std::unordered_map over std::map (#305)	Fabio R. Sluzala
89d5d90f	2023-03-21 18:11:01	Fix color codes emitting mid-UTF8 code. (#312)	Matvey Soloviev
16ffc013	2023-03-21 09:42:25	Importer for GPTQ quantized LLaMA models (#301)	comex
486ae645	2023-03-21 09:27:42	Compute perplexity over prompt (#270)	Gary Linscott
3ab3e658	2023-03-21 18:23:15	Add chatLLaMa script (#198)	Jean-Christophe Hoelt
f157088c	2023-03-21 11:21:06	makefile: Fix CPU feature detection on Haiku (#218)	Alex von Gluck IV
c86ba036	2023-03-21 18:14:46	Enable ANSI colors on Windows 10+ (#311)	anzz1
1daf4dd7	2023-03-21 18:10:32	Minor style changes	Georgi Gerganov
dc6a845b	2023-03-21 18:09:37	Add chat.sh script	Georgi Gerganov
6a612959	2023-03-21 17:05:06	Check for reverse prompt by characters instead of tokens (#292) (#330)	tjohnman
d5f56a5e	2023-03-21 17:04:43	Check for reverse prompt by characters instead of tokens (#292) (#330)	tjohnman
3bfa3b43	2023-03-21 17:59:16	Fix convert script, warnings alpaca instructions, default params	Georgi Gerganov
715d292e	2023-03-21 09:50:09	Add OpenBSD support (#314)	Kevin Lo
c98ae026	2023-03-21 08:49:43	fix typo in comment (#318)	Mack Straight
c3b2306b	2023-03-21 23:44:11	Makefile: slightly cleanup for Mac Intel; echo instead of run ./main -h (#335)	Qingyou Meng
975d2ceb	2023-03-21 17:42:43	cmdline option for custom amount of model parts (--n_parts N) (#348)	anzz1
e0ffc861	2023-03-21 08:34:49	Update IPFS links to quantized alpaca with new tokenizer format (#352)	Kevin Kwok
8f644a0a	2023-03-21 17:32:14	Change default repeat_penalty to 1.0	Georgi Gerganov
eb34620a	2023-03-21 17:29:41	Add tokenizer test + revert to C++11 (#355)	Georgi Gerganov
2e664f1f	2023-03-21 07:35:42	Add initial AVX512 support for dot product on Linux (#320)	Casey Primozic
8cf9f34e	2023-03-21 09:37:16	Adding missing features of CMakeLists.txt & Refactoring (#131)	nusu-github
bd4b46d6	2023-03-20 16:44:30	Nix flake: set meta.mainProgram to llama	Ben Siraphob
6b6d5b50	2023-03-21 03:33:10	Fixed tokenizer.model not found error when model dir is symlink (#325)	Qingyou Meng
a791a68b	2023-03-20 12:26:01	move file magic/version to header, print expected version (#319)	Mack Straight
0f1b21cb	2023-03-20 18:05:20	Docker - Fix publish docker image in GitHub Registry (#235)	Bernat Vadell
074bea2e	2023-03-20 03:17:23	sentencepiece bpe compatible tokenizer (#252)	Mack Straight
5cb63e24	2023-03-20 08:24:11	Add tqdm to Python requirements (#293)	Stephan Walter
da5303c1	2023-03-19 17:44:20	bugfix: default should not be interactive (#304)	cocktailpeanut
4545539d	2023-03-19 21:58:51	Rename script	Georgi Gerganov
edeba283	2023-03-19 21:57:28	Add temporary helper script for Alpaca chat	Georgi Gerganov
5c19c70b	2023-03-19 13:44:30	fix coloring of last `n_batch` of prompt, and refactor line input (#221)	Rickey Bowers Jr
24568371	2023-03-19 20:33:06	Support for multiple reverse prompts. (#299)	tjohnman
7392f1cd	2023-03-19 12:38:44	Improved quantize script (#222)	Suaj Carrot
ad5fd5b6	2023-03-19 19:36:19	Make prompt randomization optional. (#300)	tjohnman
368d0c8a	2023-03-19 19:31:17	Respect the maximum number of tokens in interactive. (#298)	tjohnman
50fae10d	2023-03-19 19:22:48	Add --ignore-eos parameter (#181)	slaren
084e2f0e	2023-03-20 02:10:00	interactive mode: print '\n' in sigint_handler, this flush stdout thus ensure color reset. (#283)	Qingyou Meng
0b366e73	2023-03-19 18:57:00	Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294)	Erik Scholz
160bfb21	2023-03-19 19:51:55	Update hot topics to mention Alpaca support	Georgi Gerganov
c494ed5b	2023-03-19 19:46:32	Fix off-by-one bug (#115)	Georgi Gerganov
c1c7026b	2023-03-19 19:33:18	Fix python stuff (#109)	Georgi Gerganov
467b1497	2023-03-19 20:17:39	Refactoring `convert-pth-to-ggml.py`: more concise and readable (#109)	qunash
70f01cb8	2023-03-19 19:04:44	Drop trailing new line from file prompts (#80)	Georgi Gerganov
a4e63b73	2023-03-19 18:49:50	Add instruction for using Alpaca (#240)	Georgi Gerganov
9e170721	2023-03-19 18:37:02	Add "--instruct" argument for usage with Alpaca (#240)	Georgi Gerganov
22213a17	2023-03-19 17:30:00	Change RMSNorm eps to 1e-6 (#173)	Georgi Gerganov
d7def1a7	2023-03-18 17:10:47	Warn user if a context size greater than 2048 tokens is specified (#274)	Ronsor
6f61c18e	2023-03-18 22:39:46	Fix typo in readme	Pavol Rusnak
1e5a6d08	2023-03-18 22:20:04	Add note about Python 3.11 to readme	Pavol Rusnak
554b5415	2023-03-18 21:58:46	Add memory/disk requirements to readme	Pavol Rusnak
d3f202d5	2023-03-18 20:51:49	Remove unused code since n_vocab is model.hparams.n_vocab (#262)	Alex Nguyen
e03e3597	2023-03-18 07:44:09	fixed warning with std::ignore about unused function result (#151)	Justin Suess
a81d0c2a	2023-03-18 04:17:19	Fix n^2 loop in tokenization (#254)	Gary Linscott
b2de7f18	2023-03-18 09:27:12	CI Improvements (#230)	anzz1
a2927478	2023-03-17 23:03:48	Nix flake (#40)	Niklas Korz
c9f670a1	2023-03-17 21:05:58	Implement non-greedy tokenizer that tries to maximize token lengths (#242)	thement
4f546091	2023-03-17 21:46:46	Default to 4 threads (#243)	Georgi Gerganov
e81b9c81	2023-03-17 20:30:04	Update Contributing section	Georgi Gerganov
367946c6	2023-03-17 17:47:35	Don't tell users to use a bad number of threads (#243)	Stephan Walter
6b0df5cc	2023-03-18 00:38:24	add ptread link to fix cmake build under linux (#114)	mmyjona
2af23d30	2023-03-17 10:47:06	🚀 Dockerize llamacpp (#132)	Bernat Vadell
904d2a8d	2023-03-17 05:48:39	Q4_1 quantization (#193)	Matvey Soloviev
72131107	2023-03-16 15:00:09	Update README.md	Georgi Gerganov
ac15de78	2023-03-16 08:55:13	Expand "Contributing" section	Georgi Gerganov
273abc47	2023-03-16 07:12:12	Update hot topics - RMSnorm	Georgi Gerganov
9b4a15b1	2023-03-15 19:29:25	Fix RMS norm in GGML (#191)	Nebula
6eac39ba	2023-03-15 18:41:38	Add RMS norm and use it (#187)	hoangmit
27944c42	2023-03-15 21:35:25	fixed typo (#178)	moritzbrantner
2d15d6c9	2023-03-15 13:56:24	add SIGINT support for _WIN32 environments (#120)	Rickey Bowers Jr
2d64715a	2023-03-15 15:42:40	added ctx_size parameter (#148)	Justin Suess
16b2c61a	2023-03-15 15:39:38	fixed color reset on exit (#149)	Justin Suess
977295c7	2023-03-15 22:39:06	Fix potential licensing issue (#126)	Musab Gultekin
956dfda8	2023-03-15 12:37:50	Use `tokenizer.vocab_size()` instead of hardcoding 32000 in convert-pth-to-ggml.py (#142)	Ronsor
113e685d	2023-03-15 15:05:14	inline -> static inline for "bytesFromNibbles" (#161)	hoangmit
47857e56	2023-03-14 12:34:37	Don't use vdotq_s32 if it's not available (#139)	Ronsor
60f819a2	2023-03-14 15:30:08	Add section to README on how to run the project on Android (#130)	Radoslav Gerganov
97ab2b25	2023-03-14 09:43:52	Add Misc section + update hot topics + minor fixes	Georgi Gerganov
2f700a27	2023-03-13 17:29:10	Add windows to the CI (#98)	Sebastián A
c09a9cfb	2023-03-13 21:22:15	CMake build in Release by default (#75)	Georgi Gerganov
7ec903d3	2023-03-13 19:21:51	Update contribution section, hot topics, limitations, etc.	Georgi Gerganov
4497ad81	2023-03-13 19:15:08	Print system information	Georgi Gerganov
ed6849cc	2023-03-13 14:12:33	Initial support for CMake (#75)	Sebastián A
41be0a3b	2023-03-13 17:40:54	Add NetBSD support. (#90)	Thomas Klausner
671d5cac	2023-03-13 17:39:56	Use fprintf for diagnostic output (#48)	Pavol Rusnak
84d9015c	2023-03-13 18:36:44	Use vdotq_s32 to improve performance (#67)	Georgi Gerganov
63fd76fb	2023-03-14 01:33:43	Reduce model loading time (#43)	uint256_t
2a20f48e	2023-03-13 12:24:18	Fix UTF-8 handling (including colors) (#79)	Val Kharitonov
d1f22471	2023-03-13 17:15:20	Add quantize script for batch quantization (#92)	Pavol Rusnak
1808ee05	2023-03-13 09:42:26	Add initial contribution guidelines	Georgi Gerganov
a169bb88	2023-03-13 04:08:01	Gate signal support on being on a unixoid system. (#74)	Matvey Soloviev
460c4825	2023-03-13 00:35:51	Fix token count accounting	Matvey Soloviev
c80e2a8f	2023-03-13 01:28:08	Revert "10% performance boost on ARM"	Georgi Gerganov
54a0e66e	2023-03-13 01:21:03	Check for vdotq_s32 availability	Georgi Gerganov
543c57e9	2023-03-13 01:05:24	Ammend to previous commit - forgot to update non-QRDMX branch	Georgi Gerganov
113a9e83	2023-03-13 00:56:10	10% performance boost on ARM	Georgi Gerganov
404fac0d	2023-03-12 23:07:34	Fix color getting reset before prompt output done (#65)	Matvey Soloviev
1a0a7430	2023-03-12 23:39:01	Update README.md	Georgi Gerganov
96ea727f	2023-03-12 22:13:28	Add interactive mode (#61)	Matvey Soloviev
96619548	2023-03-13 03:30:08	Fix typo in README (#45)	Marc Köhlbrugge
f385f8de	2023-03-12 13:28:36	Allow using prompt files (#59)	Ben Garney
02f0c6fe	2023-03-12 16:23:15	Add back top_k (#56)	beiller
eb062bb0	2023-03-12 17:15:00	Windows fixes (#31)	Sebastián A
7027a978	2023-03-12 22:09:26	Update README.md	Georgi Gerganov
2d555e5b	2023-03-12 22:08:24	Add CI (#60)	Georgi Gerganov
7c9e54e5	2023-03-12 20:59:01	Revert "weights_only" arg - this causing more trouble than help	Georgi Gerganov
b9bd1d01	2023-03-12 14:16:33	python/pytorch compat notes (#44)	Oleksandr Nikitin
129c7d1e	2023-03-12 05:27:42	Add repetition penalty (#20)	beiller
702fddf5	2023-03-12 09:03:25	Clarify meaning of hacking	Georgi Gerganov
7d86e25b	2023-03-12 08:41:54	README: add "Supported platforms" + update hot topics	Georgi Gerganov
a9312023	2023-03-11 22:36:35	use weights_only in conversion script (#32)	deepdiffuser
6a9a67f0	2023-03-12 07:36:03	Add LICENSE (#21)	Pavol Rusnak
da1a4ff0	2023-03-12 01:26:32	Update README.md	Georgi Gerganov
6b2cb630	2023-03-11 18:32:20	Fix a typo in model name (#16)	Juraj Bednar
4235e3d5	2023-03-11 18:10:18	Update README.md	Georgi Gerganov
f1eaff47	2023-03-11 17:58:18	Add AVX2 support for x86 architectures thanks to @Const-me !	Georgi Gerganov
a9e58529	2023-03-11 17:40:14	Fix un-initialized FP16 tables on x86 (#15, #2)	Georgi Gerganov
7d9ed7b2	2023-03-11 12:44:21	Bump memory buffer	Georgi Gerganov
0c680332	2023-03-11 12:31:21	Update README.md	Georgi Gerganov
f60fa9e5	2023-03-11 12:26:46	.gitignore models/	Georgi Gerganov
7211862c	2023-03-11 12:26:16	Update Makefile var + add comment	Georgi Gerganov
a5c5ae2f	2023-03-11 11:34:25	Update README.md	Georgi Gerganov
ea977e85	2023-03-11 11:34:11	Update README.md	Georgi Gerganov
007a8f6f	2023-03-11 10:47:09	Support all LLaMA models + change Q4_0 quantization storage	Georgi Gerganov
5f2f970d	2023-03-10 21:47:26	Include Python dependencies in README (#6)	Simon Willison
73c6ed5e	2023-03-11 01:30:47	Update README.md	Georgi Gerganov
01eeed8f	2023-03-11 01:22:58	Update README.md	Georgi Gerganov
6da2df34	2023-03-11 01:18:10	Update README.md	Georgi Gerganov
9dcf4dba	2023-03-10 18:04:06	Add missing headers for memcpy and assert (#3)	Jean-Michaël Celerier
920a7fe2	2023-03-11 00:55:22	Update README.md	Georgi Gerganov
3a57ee59	2023-03-11 00:51:46	Update README.md	Georgi Gerganov
b8502852	2023-03-11 00:09:19	Update README.md	Georgi Gerganov
8a01f565	2023-03-10 23:53:11	Update README.md	Georgi Gerganov
70bc0b8b	2023-03-10 23:46:39	Fix a bug in the rope calculation	Georgi Gerganov
18ebda34	2023-03-10 21:52:27	Update README.md	Georgi Gerganov
319cdb3e	2023-03-10 21:50:46	Final touches	Georgi Gerganov
77532806	2023-03-10 21:47:46	Create README.md	Georgi Gerganov
26c08466	2023-03-10 20:40:58	Initial release	Georgi Gerganov

ae44e23e

2023-03-22 07:47:15

When seed <= 0 - use the clock to generate one

Georgi Gerganov

928480ef

2023-03-22 07:45:00

Init llama_context_params properly from CLI (#370)

Georgi Gerganov

56817b1f

2023-03-22 07:34:02

Remove temporary notice and update hot topics

Georgi Gerganov

f5a77a62

2023-03-22 07:32:36

Introduce C-style API (#370)

Georgi Gerganov

da0e9fe9

2023-03-20 20:14:06

Add SHA256SUMS file and instructions to README how to obtain and verify the downloads

Gary Mulder

e6c9e098

2023-03-21 23:49:24

Fix bin dir for win ci

anzz1

01a297b0

2023-03-21 22:34:25

specify build type for ctest on windows (#371)

Erik Scholz

3366853e

2023-03-21 22:57:35

Add notice about pending change

Georgi Gerganov

3f9c6135

2023-03-21 16:52:27

fix typo in chatLLaMa (#368)

Mathieu Nayrolles

0f613527

2023-03-21 19:47:27

Update issue templates

Georgi Gerganov

353ec251

2023-03-21 14:21:50

We could use std::unordered_map over std::map (#305)

Fabio R. Sluzala

89d5d90f

2023-03-21 18:11:01

Fix color codes emitting mid-UTF8 code. (#312)

Matvey Soloviev

16ffc013

2023-03-21 09:42:25

Importer for GPTQ quantized LLaMA models (#301)

comex

486ae645

2023-03-21 09:27:42

Compute perplexity over prompt (#270)

Gary Linscott

3ab3e658

2023-03-21 18:23:15

Add chatLLaMa script (#198)

Jean-Christophe Hoelt

f157088c

2023-03-21 11:21:06

makefile: Fix CPU feature detection on Haiku (#218)

Alex von Gluck IV

c86ba036

2023-03-21 18:14:46

Enable ANSI colors on Windows 10+ (#311)

anzz1

1daf4dd7

2023-03-21 18:10:32

Minor style changes

Georgi Gerganov

dc6a845b

2023-03-21 18:09:37

Add chat.sh script

Georgi Gerganov

6a612959

2023-03-21 17:05:06

Check for reverse prompt by characters instead of tokens (#292) (#330)

tjohnman

d5f56a5e

2023-03-21 17:04:43

Check for reverse prompt by characters instead of tokens (#292) (#330)

tjohnman

3bfa3b43

2023-03-21 17:59:16

Fix convert script, warnings alpaca instructions, default params

Georgi Gerganov

715d292e

2023-03-21 09:50:09

Add OpenBSD support (#314)

Kevin Lo

c98ae026

2023-03-21 08:49:43

fix typo in comment (#318)

Mack Straight

c3b2306b

2023-03-21 23:44:11

Makefile: slightly cleanup for Mac Intel; echo instead of run ./main -h (#335)

Qingyou Meng

975d2ceb

2023-03-21 17:42:43

cmdline option for custom amount of model parts (--n_parts N) (#348)

anzz1

e0ffc861

2023-03-21 08:34:49

Update IPFS links to quantized alpaca with new tokenizer format (#352)

Kevin Kwok

8f644a0a

2023-03-21 17:32:14

Change default repeat_penalty to 1.0

Georgi Gerganov

eb34620a

2023-03-21 17:29:41

Add tokenizer test + revert to C++11 (#355)

Georgi Gerganov

2e664f1f

2023-03-21 07:35:42

Add initial AVX512 support for dot product on Linux (#320)

Casey Primozic

8cf9f34e

2023-03-21 09:37:16

Adding missing features of CMakeLists.txt & Refactoring (#131)

nusu-github

bd4b46d6

2023-03-20 16:44:30

Nix flake: set meta.mainProgram to llama

Ben Siraphob

6b6d5b50

2023-03-21 03:33:10

Fixed tokenizer.model not found error when model dir is symlink (#325)

Qingyou Meng

a791a68b

2023-03-20 12:26:01

move file magic/version to header, print expected version (#319)

Mack Straight

0f1b21cb

2023-03-20 18:05:20

Docker - Fix publish docker image in GitHub Registry (#235)

Bernat Vadell

074bea2e

2023-03-20 03:17:23

sentencepiece bpe compatible tokenizer (#252)

Mack Straight

5cb63e24

2023-03-20 08:24:11

Add tqdm to Python requirements (#293)

Stephan Walter

da5303c1

2023-03-19 17:44:20

bugfix: default should not be interactive (#304)

cocktailpeanut

4545539d

2023-03-19 21:58:51

Rename script

Georgi Gerganov

edeba283

2023-03-19 21:57:28

Add temporary helper script for Alpaca chat

Georgi Gerganov

5c19c70b

2023-03-19 13:44:30

fix coloring of last `n_batch` of prompt, and refactor line input (#221)

Rickey Bowers Jr

24568371

2023-03-19 20:33:06

Support for multiple reverse prompts. (#299)

tjohnman

7392f1cd

2023-03-19 12:38:44

Improved quantize script (#222)

Suaj Carrot

ad5fd5b6

2023-03-19 19:36:19

Make prompt randomization optional. (#300)

tjohnman

368d0c8a

2023-03-19 19:31:17

Respect the maximum number of tokens in interactive. (#298)

tjohnman

50fae10d

2023-03-19 19:22:48

Add --ignore-eos parameter (#181)

slaren

084e2f0e

2023-03-20 02:10:00

interactive mode: print '\n' in sigint_handler, this flush stdout thus ensure color reset. (#283)

Qingyou Meng

0b366e73

2023-03-19 18:57:00

Command line switch to use F16 for memory_k and memory_v (refactor of #154) (#294)

Erik Scholz

160bfb21

2023-03-19 19:51:55

Update hot topics to mention Alpaca support

Georgi Gerganov

c494ed5b

2023-03-19 19:46:32

Fix off-by-one bug (#115)

Georgi Gerganov

c1c7026b

2023-03-19 19:33:18

Fix python stuff (#109)

Georgi Gerganov

467b1497

2023-03-19 20:17:39

Refactoring `convert-pth-to-ggml.py`: more concise and readable (#109)

qunash

70f01cb8

2023-03-19 19:04:44

Drop trailing new line from file prompts (#80)

Georgi Gerganov

a4e63b73

2023-03-19 18:49:50

Add instruction for using Alpaca (#240)

Georgi Gerganov

9e170721

2023-03-19 18:37:02

Add "--instruct" argument for usage with Alpaca (#240)

Georgi Gerganov

22213a17

2023-03-19 17:30:00

Change RMSNorm eps to 1e-6 (#173)

Georgi Gerganov

d7def1a7

2023-03-18 17:10:47

Warn user if a context size greater than 2048 tokens is specified (#274)

Ronsor

6f61c18e

2023-03-18 22:39:46

Fix typo in readme

Pavol Rusnak

1e5a6d08

2023-03-18 22:20:04

Add note about Python 3.11 to readme

Pavol Rusnak

554b5415

2023-03-18 21:58:46

Add memory/disk requirements to readme

Pavol Rusnak

d3f202d5

2023-03-18 20:51:49

Remove unused code since n_vocab is model.hparams.n_vocab (#262)

Alex Nguyen

e03e3597

2023-03-18 07:44:09

fixed warning with std::ignore about unused function result (#151)

Justin Suess

a81d0c2a

2023-03-18 04:17:19

Fix n^2 loop in tokenization (#254)

Gary Linscott

b2de7f18

2023-03-18 09:27:12

CI Improvements (#230)

anzz1

a2927478

2023-03-17 23:03:48

Nix flake (#40)

Niklas Korz

c9f670a1

2023-03-17 21:05:58

Implement non-greedy tokenizer that tries to maximize token lengths (#242)

thement

4f546091

2023-03-17 21:46:46

Default to 4 threads (#243)

Georgi Gerganov

e81b9c81

2023-03-17 20:30:04

Update Contributing section

Georgi Gerganov

367946c6

2023-03-17 17:47:35

Don't tell users to use a bad number of threads (#243)

Stephan Walter

6b0df5cc

2023-03-18 00:38:24

add ptread link to fix cmake build under linux (#114)

mmyjona

2af23d30

2023-03-17 10:47:06

🚀 Dockerize llamacpp (#132)

Bernat Vadell

904d2a8d

2023-03-17 05:48:39

Q4_1 quantization (#193)

Matvey Soloviev

72131107

2023-03-16 15:00:09

Update README.md

Georgi Gerganov

ac15de78

2023-03-16 08:55:13

Expand "Contributing" section

Georgi Gerganov

273abc47

2023-03-16 07:12:12

Update hot topics - RMSnorm

Georgi Gerganov

9b4a15b1

2023-03-15 19:29:25

Fix RMS norm in GGML (#191)

Nebula

6eac39ba

2023-03-15 18:41:38

Add RMS norm and use it (#187)

hoangmit

27944c42

2023-03-15 21:35:25

fixed typo (#178)

moritzbrantner

2d15d6c9

2023-03-15 13:56:24

add SIGINT support for _WIN32 environments (#120)

Rickey Bowers Jr

2d64715a

2023-03-15 15:42:40

added ctx_size parameter (#148)

Justin Suess

16b2c61a

2023-03-15 15:39:38

fixed color reset on exit (#149)

Justin Suess

977295c7

2023-03-15 22:39:06

Fix potential licensing issue (#126)

Musab Gultekin

956dfda8

2023-03-15 12:37:50

Use `tokenizer.vocab_size()` instead of hardcoding 32000 in convert-pth-to-ggml.py (#142)

Ronsor

113e685d

2023-03-15 15:05:14

inline -> static inline for "bytesFromNibbles" (#161)

hoangmit

47857e56

2023-03-14 12:34:37

Don't use vdotq_s32 if it's not available (#139)

Ronsor

60f819a2

2023-03-14 15:30:08

Add section to README on how to run the project on Android (#130)

Radoslav Gerganov

97ab2b25

2023-03-14 09:43:52

Add Misc section + update hot topics + minor fixes

Georgi Gerganov

2f700a27

2023-03-13 17:29:10

Add windows to the CI (#98)

Sebastián A

c09a9cfb

2023-03-13 21:22:15

CMake build in Release by default (#75)

Georgi Gerganov

7ec903d3

2023-03-13 19:21:51

Update contribution section, hot topics, limitations, etc.

Georgi Gerganov

4497ad81

2023-03-13 19:15:08

Print system information

Georgi Gerganov

ed6849cc

2023-03-13 14:12:33

Initial support for CMake (#75)

Sebastián A

41be0a3b

2023-03-13 17:40:54

Add NetBSD support. (#90)

Thomas Klausner

671d5cac

2023-03-13 17:39:56

Use fprintf for diagnostic output (#48)

Pavol Rusnak

84d9015c

2023-03-13 18:36:44

Use vdotq_s32 to improve performance (#67)

Georgi Gerganov

63fd76fb

2023-03-14 01:33:43

Reduce model loading time (#43)

uint256_t

2a20f48e

2023-03-13 12:24:18

Fix UTF-8 handling (including colors) (#79)

Val Kharitonov

d1f22471

2023-03-13 17:15:20

Add quantize script for batch quantization (#92)

Pavol Rusnak

1808ee05

2023-03-13 09:42:26

Add initial contribution guidelines

Georgi Gerganov

a169bb88

2023-03-13 04:08:01

Gate signal support on being on a unixoid system. (#74)

Matvey Soloviev

460c4825

2023-03-13 00:35:51

Fix token count accounting

Matvey Soloviev

c80e2a8f

2023-03-13 01:28:08

Revert "10% performance boost on ARM"

Georgi Gerganov

54a0e66e

2023-03-13 01:21:03

Check for vdotq_s32 availability

Georgi Gerganov

543c57e9

2023-03-13 01:05:24

Ammend to previous commit - forgot to update non-QRDMX branch

Georgi Gerganov

113a9e83

2023-03-13 00:56:10

10% performance boost on ARM

Georgi Gerganov

404fac0d

2023-03-12 23:07:34

Fix color getting reset before prompt output done (#65)

Matvey Soloviev

1a0a7430

2023-03-12 23:39:01

Update README.md

Georgi Gerganov

96ea727f

2023-03-12 22:13:28

Add interactive mode (#61)

Matvey Soloviev

96619548

2023-03-13 03:30:08

Fix typo in README (#45)

Marc Köhlbrugge

f385f8de

2023-03-12 13:28:36

Allow using prompt files (#59)

Ben Garney

02f0c6fe

2023-03-12 16:23:15

Add back top_k (#56)

beiller

eb062bb0

2023-03-12 17:15:00

Windows fixes (#31)

Sebastián A

7027a978

2023-03-12 22:09:26

Update README.md

Georgi Gerganov

2d555e5b

2023-03-12 22:08:24

Add CI (#60)

Georgi Gerganov

7c9e54e5

2023-03-12 20:59:01

Revert "weights_only" arg - this causing more trouble than help

Georgi Gerganov

b9bd1d01

2023-03-12 14:16:33

python/pytorch compat notes (#44)

Oleksandr Nikitin

129c7d1e

2023-03-12 05:27:42

Add repetition penalty (#20)

beiller

702fddf5

2023-03-12 09:03:25

Clarify meaning of hacking

Georgi Gerganov

7d86e25b

2023-03-12 08:41:54

README: add "Supported platforms" + update hot topics

Georgi Gerganov

a9312023

2023-03-11 22:36:35

use weights_only in conversion script (#32)

deepdiffuser

6a9a67f0

2023-03-12 07:36:03

Add LICENSE (#21)

Pavol Rusnak

da1a4ff0

2023-03-12 01:26:32

Update README.md

Georgi Gerganov

6b2cb630

2023-03-11 18:32:20

Fix a typo in model name (#16)

Juraj Bednar

4235e3d5

2023-03-11 18:10:18

Update README.md

Georgi Gerganov

f1eaff47

2023-03-11 17:58:18

Add AVX2 support for x86 architectures thanks to @Const-me !

Georgi Gerganov

a9e58529

2023-03-11 17:40:14

Fix un-initialized FP16 tables on x86 (#15, #2)

Georgi Gerganov

7d9ed7b2

2023-03-11 12:44:21

Bump memory buffer

Georgi Gerganov

0c680332

2023-03-11 12:31:21

Update README.md

Georgi Gerganov

f60fa9e5

2023-03-11 12:26:46

.gitignore models/

Georgi Gerganov

7211862c

2023-03-11 12:26:16

Update Makefile var + add comment

Georgi Gerganov

a5c5ae2f

2023-03-11 11:34:25

Update README.md

Georgi Gerganov

ea977e85

2023-03-11 11:34:11

Update README.md

Georgi Gerganov

007a8f6f

2023-03-11 10:47:09

Support all LLaMA models + change Q4_0 quantization storage

Georgi Gerganov

5f2f970d

2023-03-10 21:47:26

Include Python dependencies in README (#6)

Simon Willison

73c6ed5e

2023-03-11 01:30:47

Update README.md

Georgi Gerganov

01eeed8f

2023-03-11 01:22:58

Update README.md

Georgi Gerganov

6da2df34

2023-03-11 01:18:10

Update README.md

Georgi Gerganov

9dcf4dba

2023-03-10 18:04:06

Add missing headers for memcpy and assert (#3)

Jean-Michaël Celerier

920a7fe2

2023-03-11 00:55:22

Update README.md

Georgi Gerganov

3a57ee59

2023-03-11 00:51:46

Update README.md

Georgi Gerganov

b8502852

2023-03-11 00:09:19

Update README.md

Georgi Gerganov

8a01f565

2023-03-10 23:53:11

Update README.md

Georgi Gerganov

70bc0b8b

2023-03-10 23:46:39

Fix a bug in the rope calculation

Georgi Gerganov

18ebda34

2023-03-10 21:52:27

Update README.md

Georgi Gerganov

319cdb3e

2023-03-10 21:50:46

Final touches

Georgi Gerganov

77532806

2023-03-10 21:47:46

Create README.md

Georgi Gerganov

26c08466

2023-03-10 20:40:58

Initial release

Georgi Gerganov

Liu Song’s Projects

~/Projects/llama.cpp

History