[korean_project(1)] json data 파싱 and save with txt file

Notice

Recent Posts

Recent Comments

Link

« 2025/05 »
일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

:)

[korean_project(1)] json data 파싱 and save with txt file 본문

ETC

[korean_project(1)] json data 파싱 and save with txt file

mihee 2023. 1. 3. 16:31

ai hub에서 한국어 글자체 이미지를 다운받아 데이터 형식을 파악하고자 한다.

but 크기가 너무 커 파일을 열어보기 힘들어 python 언어를 사용하여 json 파일을 파싱

1. json 파일을 읽고 key를 확인

import json
import numpy as np

with open('./handwriting_data_info_clean.json') as f:
    datas = json.load(f)

print(datas.keys()) # dict_keys(['info', 'images', 'annotations', 'licenses'])

이중 내가 필요한 데이터는 annotations로 이 부분만 가져와 출력 해보고자 한다.

2. annotations 가져와 필요한 부분만 새 리스트에 저장

annotations 데이터 - 이중 필요한 txt와 image_id만 가져와 사용하기로 한다.

annotations = datas['annotations']
# print(type(annotations)) # list
arr = [] 
for annotation in annotations:
    attribute = annotation['attributes']
    if attribute['type'] == '글자(음절)':
        arr.append([annotation['image_id']+'.png', annotation['text']])
new_arr = np.array(arr)

3. 필요한 데이터만 뽑아 저장한 array를 txt 형식으로 저장

np.savetxt('./annotation.txt',new_arr, fmt='%s',delimiter=' ')
print("-------------save done.---------")

저작자표시 (새창열림)

'ETC' 카테고리의 다른 글

유투브 자동 광고건너뛰기 (0)	2022.10.31

'ETC' Related Articles

유투브 자동 광고건너뛰기 2022.10.31

Comments

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

:)

:)

[korean_project(1)] json data 파싱 and save with txt file 본문

[korean_project(1)] json data 파싱 and save with txt file

'ETC' 카테고리의 다른 글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역