'Deep Learning' 카테고리의 글 목록 (7 Page)

[Python] onnxruntime gpu(device) 에 업로드된 데이터 cpu(host)로 다운로드 안하고 바로 inference 하는 방법

onnxruntime inference 예제를 찾아보면 거의 input은 cpu에서 pre-processing한 numpy array(on cpu)를 session.run 함수의 입력으로 주는 경우가 많습니다. 그치만 실제로는 pre-processing도 GPU에서 하고 이걸 굳이 cpu 로 내려서 입력하는 일은 없는 게 일반적일겁니다. GPU, CPU 업로드, 다운로드 횟수는 줄일 수 있으면 최대한 줄여야 하는 아주 악의 축 같은 작업입니다. 특히 input 사이즈가 큰데 GPU 업로드 했다 CPU로 다운로드 했다 하다보면 차라리 CPU로 구현하는 것만 못한 속도가 나올겁니다. 그래서, GPU에 있는 데이터를 바로 추론할 수 있어야 합니다! onnxruntime 은 당연히 이런 기능을 제공하고 있습니다..

Deep Learning 2024. 4. 11. 23:56

ConvNet vs ViT 비교 논문

https://arxiv.org/abs/2311.09215 ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy Modern computer vision offers a great variety of models to practitioners, and selecting a model from multiple options for specific applications can be challenging. Conventionally, competing model architectures and training protocols are compared by their c arxiv.org 대충 결론만 봄 나중에 다시 읽어보자(과연?)

Deep Learning 2024. 2. 29. 00:01

20240222 YOLOv9 가 공개되다.

https://github.com/WongKinYiu/yolov9 GitHub - WongKinYiu/yolov9: Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Inform Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information - WongKinYiu/yolov9 github.com https://arxiv.org/pdf/2402.13616.pdf X축이 Latency 면 어떻게 그려질지 궁금하다.

Deep Learning 2024. 2. 22. 20:21

META V-JEPA

https://ai.meta.com/blog/v-jepa-yann-lecun-ai-model-video-joint-embedding-predictive-architecture/ V-JEPA: The next step toward advanced machine intelligence Previous work had to do full fine-tuning, which means that after pre-training your model, when you want the model to get really good at fine-grained action recognition while you’re adapting your model to take on that task, you have to updat..

Deep Learning 2024. 2. 18. 22:39

Google Magika 파일 종류 인식을 AI로

https://opensource.googleblog.com/2024/02/magika-ai-powered-fast-and-efficient-file-type-identification.html Magika: AI powered fast and efficient file type identification Magika code and model are freely available starting today in Github under the Apache2 License. opensource.googleblog.com https://github.com/google/magika GitHub - google/magika: Detect file content types with deep learning Det..

Deep Learning 2024. 2. 18. 21:48

COCO pre-trained YOLOv5, YOLOv8의 입력 이미지 사이즈와 COCO 데이터셋의 이미지 해상도

COCO pre-trained YOLOv5, 8의 입력 이미지 사이즈는 대개 640x640으로 알려져있습니다. YOLOv5는 1280x1280 사이즈를 입력으로 받는 High mAP 지향형 모델도 있기는 합니다. 왜 640x640일까요? 우선 FPN 구조이자 입력 이미지가 모델 구조에 의해 32x Downsampling이 되기 때문에 입력 이미지의 해상도가 32의 배수여야 하는 제한이 있습니다. 그러고보면 640이 32의 배수인 걸 알 수 있습니다. 근데 32의 배수는 무수히 많은데 왜 640 일까요? COCO 데이터셋의 해상도에 대한 통계치를 추출해보겠습니다. 2017년도에 구축된 데이터셋 기준입니다. Train #images: 118287 min h: 51, max h: 640, mean h: 484..

Deep Learning 2024. 2. 12. 16:01

PCIe 5.0 지원 메인보드가 나왔구나

CPU2GPU, GPU2CPU 정말 골치거리이죠. 프로세서간 메모리 업로드, 다운로드 과정의 Latency는 PCie 레인에 의해 결정되다보니 어떻게 줄일 방도가 없어서 많이 답답한 부분 같습니다. 저는 중간 중간 PC 스펙을 다나와에서 맞춰보곤 합니다. 요새 메인보드 보니 PCIe 5.0 을 지원하더라고요. 어서 PCIe 5.0을 지원하는 제품들이 많아져서 장치간 데이터 전송에서 오는 딜레이가 대폭 감소되었으면 합니다.

Deep Learning 2024. 2. 11. 23:28

YOLOv8 low-level output visualization code

https://github.com/developer0hye/Explainable-YOLOv8 GitHub - developer0hye/Explainable-YOLOv8: Visualize the low-level outputs of YOLOv8 to analyze and understand the areas where o Visualize the low-level outputs of YOLOv8 to analyze and understand the areas where our model focuses. Specifically, illustrate which anchor points are activated to predict bounding boxes. - GitHub... github.com 어떤 앵커..

Deep Learning 2024. 2. 2. 00:57

Apple이 Vision Transformer 를 자사의 NPU에서 효율적으로 구동시키기 위해 시도한 것들

https://machinelearning.apple.com/research/neural-engine-transformers Deploying Transformers on the Apple Neural Engine An increasing number of the machine learning (ML) models we build at Apple each year are either partly or fully adopting the [Transformer… machinelearning.apple.com https://machinelearning.apple.com/research/vision-transformers Deploying Attention-Based Vision Transformers to A..

Deep Learning 2024. 1. 22. 17:22

[Deep Learning] 관심이 생긴 Detection 모델 Plain-DETR, DETR Does Not Need Multi-Scale or Locality Design

https://github.com/impiga/Plain-DETR GitHub - impiga/Plain-DETR: [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design - GitHub - impiga/Plain-DETR: [ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design github.com https://openaccess.thecvf.com/content/ICCV2023/html/Lin_DETR_Does_Not_Need_Multi-Scale_or_Locality_Design_ICCV_2..

Deep Learning 2023. 12. 6. 18:25

이전 1 ··· 4 5 6 7 8 9 10 ··· 16 다음

이전 다음

공지사항

최근에 올라온 글

최근에 달린 댓글

Total

Today

Yesterday

링크

TAG more

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

글 보관함

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

지속 가능한 꾸준함

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역