챗GPT 어제자로 터지고 나서 오늘 GPT2 된거 같음 (API 아님) (찌라시)

개념글 모음

알림 알림 중 알림 취소

구독자 2716명 알림수신 78명

각 채널의 개념글을 모은 게시판입니다.

AI 채팅 일반 챗GPT 어제자로 터지고 나서 오늘 GPT2 된거 같음 (API 아님) (찌라시)

HyperBlaze456

추천 21 비추천 0 댓글 20 조회수 1011 작성일 2024-05-04 03:08:24 수정일 2024-05-04 03:31:52

https://arca.live/b/characterai/105322006

어제 오후 4시경 GPT가 터졌었음.. 한 1주일 전에도 이랬는데 한번 또 터져서 샘 알트먼 욕했는데, 오늘 갑자기 코드 해석 문제가 있어서 물어봄. 근데 내가 알던 GPT4가 아님. 20$로 클황으로 옮기려 했던 내자신이 부끄러워짐

우선 생성 속도가 진짜 빨라짐 -> 1000개가량 코드 해석 후 변환(.ts -> .py)하는 작업 수행시 한 10-15초 정도 걸린듯.

초당 토큰은 최소 50t 정도로 보임.

또 항상 고질적으로 지적되는 delve(이게 ... 기초 알려줘 할때 항상 뜸)가 전혀 안나옴.

그다음에 GPT 계열이 질문 답할때 'Sure! let's dive(delve) into ..." 로 의미없이 질질 끌면서 답을 해서 정작 '코드짜줘' 하면 절반도 못 생성하거나 아예 기능이 삭제된채로 나오는 문제점이 있었는데, 이것도 저렇게 의미없는 말을 싹 줄이고 그냥 담백하게 바로 들어가서 편안함.

일단 많이 써보진 않았고, 이놈들 한 새벽 2시쯤까지는 똑같았는데 갑자기 바뀐걸 보니 진짜 뭔가임..

몰래몰래 업글해주다니 고맙다 알트먼형 욕해서 미안해

글고 혹시 궁금한거나 물어보고 싶은거 있으면 댓글로 달아주셈.. 테스트를 잘하는 편은 아니라서

테스트 1: 리스의 하누라이 메모리 타입스크립트를 파이썬으로 변환하기

프롬프트는 그냥 코드 때려박고 convert this typescript code to python 한줄만 작성함

결과:

The TypeScript code provided is a function that manages memory for a chat system using a vector search engine and token count management. It processes chat history, calculates similarity scores for recent messages, and adjusts the context to fit within a maximum token limit. Here's a breakdown of the code:

Imports: The code imports several modules and types, including an error alert function, a memory processor (HypaProcesser), a tokenizer, and a database.
Function Signature: The hanuraiMemory function takes an array of chats and an argument object with the current token count, maximum context token count, and a tokenizer.
Variable Initialization: The function initializes several variables, including a HypaProcesser instance, an array for additional texts, and a query index for recent chat queries.
Chat Processing: The function loops through recent chats, processes them, and adds them to the memory processor.
Similarity Search: The function calculates similarity scores for recent chat messages and aggregates them.
Token Management: The function adjusts the token count by removing older chat messages until the current token count fits within the maximum context token count.
Result Assembly: The function assembles result texts based on similarity scores and adds a system memory message to the chat history.
Return: The function returns the updated token count and chat history.(이밑 코드 생략)

스스로 코드 설명까지 해주고 전체적으로 파이썬이랑 일치하는거 같음

테스트2: 코딩 설명해주기

프롬(영어): XOR 게이트를 3개의 레이어(입출력 포함)으로 구현하는 법을 각각 4개의 단계(앞에 자기 스스로 만듬)로 쪼개서 설명해줘

결과:

Sure, let’s set up a simple neural network in Python to solve the XOR problem. We will implement each of the four steps you mentioned.

### Step 1: Initializing Weights and Biases

We start by setting up the architecture of the neural network and initializing the weights and biases. We'll use NumPy for this purpose since it provides efficient array operations.

```python

import numpy as np

# XOR inputs and outputs

X = np.array([[0, 0], [0, 1], [1, 0], [1, 1]])

y = np.array([[0], [1], [1], [0]])

# Initialize weights and biases

input_size = 2

hidden_size = 2

output_size = 1

np.random.seed(42) # For reproducibility

W1 = np.random.uniform(size=(input_size, hidden_size))

b1 = np.random.uniform(size=(1, hidden_size))

W2 = np.random.uniform(size=(hidden_size, output_size))

b2 = np.random.uniform(size=(1, output_size))

```

### Step 2: Forward Propagation

Next, we compute the outputs of the network given the inputs. The outputs of each layer are calculated using matrix multiplication followed by applying an activation function.

```python

def sigmoid(x):

return 1 / (1 + np.exp(-x))

def forward(X, W1, b1, W2, b2):

# Calculate hidden layer activation

hidden_input = np.dot(X, W1) + b1

hidden_output = sigmoid(hidden_input)

# Calculate output layer activation

final_input = np.dot(hidden_output, W2) + b2

final_output = sigmoid(final_input)

return hidden_output, final_output

hidden_output, final_output = forward(X, W1, b1, W2, b2)

```

### Step 3: Loss Calculation

To measure the error, we can use binary cross-entropy, which is suitable for binary classification problems like XOR.

```python

def binary_cross_entropy(y_true, y_pred):

return -np.mean(y_true * np.log(y_pred) + (1 - y_true) * np.log(1 - y_pred))

loss = binary_cross_entropy(y, final_output)

```

### Step 4: Backpropagation and Optimization

We now calculate the gradients using backpropagation and adjust the weights and biases accordingly. We use the chain rule to propagate errors backward through the network.

```python

def backward(X, y, hidden_output, final_output, W1, b1, W2, b2, learning_rate):

# Output layer error

output_error = final_output - y

output_delta = output_error * final_output * (1 - final_output)

# Hidden layer error

hidden_error = output_delta.dot(W2.T)

hidden_delta = hidden_error * hidden_output * (1 - hidden_output)

# Update weights and biases

W2 -= hidden_output.T.dot(output_delta) * learning_rate

b2 -= np.sum(output_delta, axis=0, keepdims=True) * learning_rate

W1 -= X.T.dot(hidden_delta) * learning_rate

b1 -= np.sum(hidden_delta, axis=0, keepdims=True) * learning_rate

return W1, b1, W2, b2

# Hyperparameters

learning_rate = 0.1

epochs = 10000

for _ in range(epochs):

hidden_output, final_output = forward(X, W1, b1, W2, b2)

W1, b1, W2, b2 = backward(X, y, hidden_output, final_output, W1, b1, W2, b2, learning_rate)

# After training, the final output should approximate the expected results

hidden_output, final_output = forward(X, W1, b1, W2, b2)

print("Final outputs after training:\n", final_output)

```

### Summary

In this example, you’ve seen how to implement a neural network for the XOR problem using the following steps:

1. **Initialize Weights and Biases**: We set up the network architecture and initialized the weights and biases.

2. **Forward Propagation**: We calculated the hidden and final outputs using matrix multiplication and a sigmoid activation function.

3. **Loss Calculation**: We used binary cross-entropy to calculate the error.

4. **Backpropagation and Optimization**: We computed the gradients and adjusted the weights and biases using gradient descent.

With this foundation in place, you can now explore more complex networks and problems. How comfortable do you feel with this implementation, and are there any parts you'd like to delve into further or clarify?

완전히 delve 문제가 해결된건 아닌데, 대충 봤을때 앞에서 delve 나오는 짜증나는 스타트는 아님

굿굿

ㅇㅇ

2024-05-04 03:11:21

ㅇㅇ

2024-05-04 03:13:48

나도 어제 갑자기 The previous model used in this conversation is unavailable. We've switched you to the latest default model. 뜨길래 혹시? 했음

펼쳐보기▼

HyperBlaze456

2024-05-04 03:17:36

오 이건 진짜 대놓고 언급하는거 같네..

펼쳐보기▼

ecf70e1c

2024-05-04 03:16:26

다시 구독할 때인가

펼쳐보기▼

관련자

2024-05-04 03:16:59

주숭

2024-05-04 03:22:56

2024-05-04 03:32:53

아스키 코드로 유니콘 그려보면 알수있겠네

펼쳐보기▼

HyperBlaze456

2024-05-04 03:38:25

음.. 아스키 유니콘은 이상하게 또 나오네.. 이건 진짜 뭘까? 일단 확실히 이전 GPT4 같은 느낌은 아니긴 함, 그렇다고 gpt2 chatbot의 국룰 유니콘도 아니고..

펼쳐보기▼

Abrac

2024-05-04 04:07:07

어이김씨손가락이나지워

2024-05-04 04:38:43

이거 맞는듯?
GPT2-chatbot만이 유일하게 무서운 이야기를 존나 잘하는 LLM이었는데 잘한다

대부분의 챗봇 무서운 이야기하라고 하면 우화 같은거 배껴서 개소리 하거든

펼쳐보기▼

어이김씨손가락이나지워

2024-05-04 04:41:26

보니까 섞여 있는건가?
다시 시키니까 못하는데 ㅋㅋㅋㅋ

걍 우연의 일치인가 ㅅㅂ

펼쳐보기▼

HyperBlaze456

2024-05-04 05:18:49

어쨋든 뭘 어떻게 좀더 강화시킨거 같긴 함, 걍 gpt2, gpt4.5 이렇게 단정짓기도 뭐함
난 갠적으로 느낌이 두개를 합친거 아닐까, 라는 생각이 듦

펼쳐보기▼

어이김씨손가락이나지워

2024-05-04 05:38:31

생각해보면 그림 AI는 규모가 작아서 체크포인트 머지가 잘되고 잘 작동하는 편인데 LLM이 그런게 가능한가?
두 개를 합쳤을 수도

펼쳐보기▼

HyperBlaze456

2024-05-04 05:53:41

뭐 GPT 유출된거 봐보면 MoE 구조로 220B 짜리를 8개 내지 16개 굴리는 방법으로 된다고 하니까.. 인기많은 몇개 (코딩, 교육) 이런것들 시험적으로 gpt2로 돌리고 소설이나 이런거는 랜덤으로 할수도 있지
생각보다 쉬울수도 있음

펼쳐보기▼

어이김씨손가락이나지워

2024-05-04 05:57:58

MoE로 묶은 것 중에 하나가 gpt2-chatbot이라면 성능 오락가락 하는 현상을 이해할 수 있네

펼쳐보기▼

돕아킨

2024-05-04 04:46:08

K아논

2024-05-04 07:12:43

Api보다 챗봇을 먼저 업뎃하는건 뭔가 뭔가네

펼쳐보기▼

Radiohead1992

2024-05-04 10:31:49

무슨뜻인지 이해가 안가네

펼쳐보기▼

dsa84354dsa

2024-05-08 14:22:59

지금 그냥 채찍에 있는 gpt4가 gpr2가 됐다고?

펼쳐보기▼

HyperBlaze456

2024-05-08 16:39:20

ㄴㄴ
8x220b 또는 16x110b 형태인 gpt4에서 몇개를 gpt2로 갈아끼운것처럼 능지가 오른것 같다는거임

펼쳐보기▼

본 게시물에 댓글을 작성하실 권한이 없습니다. 로그인 하신 후 댓글을 다실 수 있습니다. 아카라이브 로그인

전체글 AI 채팅 채널

최근 최근 방문 채널

최근 방문 채널

전체

번호 제목

작성자 작성일 조회수 추천

공지 아카라이브 모바일 앱 이용 안내(iOS/Android)

*ㅎㅎ 2020.08.18 28363822

2923154 피난민 직구규제 절대 알리테무 막으려고한거 아님 [5]

dd 26분전 93 5

2923153 리얼돌 💖돌붕짤 뉴비 돌붕이 첫돌 신고한다 ㅋㅋ [11]

SilentSky 1시간전 168 10

2923152 에어소프트 🎓리뷰/정보 받은지 하루 지나서 올리는 Kizunaworks MKW(PL15L) 리뷰 [5]

ㅇㅇ 41분전 146 10

2923151 헤븐 번즈 레드 헤번레콘 개인용 3이 출시되었습니다 [12]

수생 1시간전 66 5

2923150 캬루 코미의 나른한 오후 [4]

캬무망 1시간전 27 5

2923149 명조 ★창작 장리 낙서 [8]

띵조 3분전 82 15

2923148 피난민 이럴거면 왜 직구 규제를 한건가 [6]

Z맘 56분전 130 6

2923147 유머 유머 피식대학 영양편 가장 기괴한 장면 [18]

아우우 36분전 1044 29

2923146 블루 아카이브 🎨창작 농 와카모 수영복 그림 [15]

BADA_01 35분전 261 22

2923145 유머 유머 KC 인증을 받지못해 사망한 물건 [3]

커미션넣고싶다 1시간전 1271 23

2923144 던전앤파이터 룩 [던룩대] 블아) 키리후지 나기사 [7]

400_Bad_Request 05-17 288 11

2923143 TS물 🔞19창작 . [16]

사라수아 41분전 97 7

2923142 가이진 월탱 위장막 k1 존나 멋있네 [10]

미야 3시간전 466 12

2923141 던전앤파이터 🎨창작 안개신 그림모음.jpg [8]

장곤단 8분전 202 13

2923140 툴리우스 🔞스샷 섹랩모션 포팅됏대서 써보는데 왤케 웃기냐 [20]

무지성눈나 27분전 165 9

2923139 에어소프트 일반 잠시 논평 올린거 공유하고 갑니다 [4]

톰스카조아 49분전 245 18

2923138 명조 픽업 가이드 : 뽑을것 거를것 [46]

은하 28분전 379 10

2923137 툴리우스 🔞스샷 이종간)근위기사 블레어씨 [15]

jdjxjdjdjd 1시간전 173 7

2923136 에어소프트 일반 시위 10분간 휴식 (중간보고) [28]

외국인이에오 11분전 551 78

2923135 Manjuu 소개 플뢰레 소개 [11]

젯 38분전 145 11

2923134 미소녀 피규어 후기 TXS 스튜디오 - 1/6 고블린 4.0 [23]

슬레이즈 20분전 243 12

2923133 트릭컬 RE:VIVE 🎨창작 캬롯 [5]

김씨 47분전 110 11

2923132 유머 유머 느그나라가 진짜 개ㅈ된 이유 [9]

응애나민간인 46분전 1707 35

2923131 가이진 콘솔월탱 K1전차 정보 [13]

Rhner 4시간전 547 12

2923130 TS물 생각바구니 "난 네 엄마야! 자위 정도는 도와줄 수 있겠지만, 섹스는 안 돼!" [3]

사라수아 21분전 76 8

2923129 유희왕 공식정보 V점 6월호 동봉정리 [5]

eternity 1시간전 216 10

2923128 에어소프트 일반 직구 계엄령 - 직구완박.gif [4]

고정근접 17분전 498 27

2923127 브라운더스트2 무붕이 쁘더 시작한다 [6]

우이귀여워 56분전 168 6

2923126 로보토미 코퍼레이션 ★☆★림버스 컴퍼니 치트 에디터 다운로드 v1.16.1★☆★ [15]

ㅇㅇ 1시간전 555 19

전체글 AI 채팅 채널

사용하고 계신 브라우저가 시간대 설정을 지원하지 않으므로 GMT 시간대가 적용됩니다.