darknet

mirror of https://github.com/pjreddie/darknet.git synced 2023-08-10 21:13:14 +03:00

Author	SHA1	Message	Date
AlexeyAB	3969ce30ed	Speedup Tensor Cores: 1st layer uses FP32 and pre-allocate GPU memory for Tensor Cores	2018-12-11 23:48:58 +03:00
AlexeyAB	25f133d6ef	Another one minor fix	2018-12-11 21:26:36 +03:00
AlexeyAB	cb998db949	Some fix for CUDNN_HALF	2018-12-11 21:16:18 +03:00
AlexeyAB	a621235783	Switch to Tensor Cores after 2000 iterations.	2018-12-10 01:35:08 +03:00
AlexeyAB	dc7f8a32ae	mAP calculation during training, if is used flag -map	2018-12-09 18:18:47 +03:00
AlexeyAB	742bb7c7ce	Compile fix	2018-12-07 22:52:07 +03:00
AlexeyAB	7c2f302321	Fixed nan issue for training with CUDNN_HALF=1 by using Tensor Cores	2018-12-07 22:40:10 +03:00
AlexeyAB	21a4ec9390	Saving loss-chart for each 100 iterations automatically	2018-11-26 11:11:56 +03:00
AlexeyAB	9f7d7c58b5	Minor fixes. Use CUDA 10.0	2018-11-17 02:48:46 +03:00
AlexeyAB	25f65f6878	Added fast_binarize_weights_gpu()	2018-11-05 22:38:35 +03:00
AlexeyAB	c0e2512af2	Activation improvement, more robust timer.	2018-09-27 23:10:54 +03:00
AlexeyAB	7dd97537fb	XNOR-net tiny-yolo_xnor.cfg ~2x faster than cuDNN on CUDA (nVidia GPU Maxwell)	2018-09-22 02:01:14 +03:00
AlexeyAB	03e95320a1	XNOR coalesced memory access, and avoid bank conflicts	2018-09-17 23:39:25 +03:00
AlexeyAB	ca43bbdaae	Fixed openmp bugs for XNOR	2018-09-12 16:22:54 +03:00
AlexeyAB	c0e01fd63c	Test for XNOR-conv on CUDA	2018-09-08 02:46:05 +03:00
AlexeyAB	b141f85cab	Compile fix	2018-09-07 15:07:46 +03:00
AlexeyAB	007878393f	Temporary Slow implementation of XNOR on CUDA (shared_memory)	2018-09-06 23:21:26 +03:00
AlexeyAB	c4a9e3422e	Temporary implementation of XNOR on CUDA	2018-08-31 02:47:58 +03:00
AlexeyAB	9753b72aeb	temp fix, don't use it	2018-08-30 17:24:41 +03:00
AlexeyAB	cfc5fedbb6	Just used spaces for indents instead of Tabs	2018-07-10 23:29:15 +03:00
AlexeyAB	9bae70b225	Accelerated by another 5% using FP16/32 Batch-norm for Tensor Cores.	2018-04-17 02:51:11 +03:00
AlexeyAB	537d135feb	Improve training performance - batch-norm using cuDNN.	2018-03-20 02:16:51 +03:00
AlexeyAB	880cf187d8	Fixed multi-GPU training for Tensor Cores	2018-03-09 19:44:46 +03:00
AlexeyAB	cad4d1618f	Added support for Tensor Cores CC >= 7.0 (V100). For FP16/32 (mixed precision) define CUDNN_HALF should be used.	2018-02-25 16:29:44 +03:00
AlexeyAB	cd2bdec090	Updated to CUDA 9.1. And fixed no_gpu dependecies.	2018-02-23 15:05:31 +03:00
AlexeyAB	6332ea99ab	one more fix	2018-02-23 00:13:08 +03:00
AlexeyAB	b2b5756d86	Added __float2half_rn() and __half2float()	2018-02-22 23:52:43 +03:00
AlexeyAB	dda993f3dd	Use half_float16 instead of float32 if defined both CUDNN and CUDNN_HALF. Use Tensor Cores.	2018-02-22 22:54:40 +03:00
AlexeyAB	9920410ba9	minor fix	2017-07-14 12:11:45 +03:00
AlexeyAB	d7a30ada7e	Fixed behavior if missing library cudnn.lib	2017-01-16 12:51:42 +03:00
AlexeyAB	3b9afd4cd2	Fixed behavior if missing library cudnn.lib	2017-01-16 00:44:41 +03:00
Joseph Redmon	75fe603722	:vegan: :charizard:	2016-11-24 22:56:23 -08:00
Joseph Redmon	c7a700dc22	new font strategy	2016-11-05 14:09:21 -07:00
Joseph Redmon	352ae7e65b	ADAM	2016-10-26 08:35:44 -07:00
Joseph Redmon	73f7aacf35	better multigpu	2016-09-20 11:34:49 -07:00
Joseph Redmon	5c067dc447	good chance I didn't break anything	2016-09-12 13:55:20 -07:00
Joseph Redmon	8f1b4e0962	updates and things	2016-09-01 16:48:41 -07:00
Joseph Redmon	afb8b4f98b	CVPR prep	2016-06-22 21:46:32 -07:00
Joseph Redmon	08c7cf9c88	no mean on input binarization	2016-06-19 14:28:15 -07:00
Joseph Redmon	8322a58cf6	hate warnings	2016-06-14 11:30:28 -07:00
Joseph Redmon	729ce43e6e	stuff	2016-06-09 17:20:31 -07:00
Joseph Redmon	ec3d050a76	hope i didn't break anything	2016-06-02 15:25:24 -07:00
Joseph Redmon	13209df7bb	art, cudnn	2016-05-13 11:59:43 -07:00
Joseph Redmon	c7b10ceadb	so much need to commit	2016-05-06 16:25:16 -07:00
Joseph Redmon	cff59ba135	go updates	2016-03-16 04:30:48 -07:00
Joseph Redmon	d1965bdb96	Go	2016-03-13 23:18:42 -07:00
Joseph Redmon	16d06ec0db	stuff	2016-02-29 13:54:12 -08:00
Joseph Redmon	913d355ec1	lots of stuff	2016-01-28 12:30:38 -08:00
Joseph Redmon	892923514f	fixed darknet, stuff	2015-12-08 15:12:10 -08:00
Joseph Redmon	c2738835f0	Faster batch normalization	2015-12-07 17:18:04 -08:00

1 2

66 Commits