python - GPT-Neo-1.3B not giving answers when called via API - Stack Overflow

IT技术

更新时间：2025-04-124

admin管理员组
文章数量:1403516

I'm trying to run GPT-Neo-1.3B locally in my server and call it via an API. I installed everything and when I call it with an API I'm getting the same answer I'm asking as response. Below is my code and my request/response. I'm using python/Flask

Any idea what is wrong with my code please?

from transformers import GPTNeoForCausalLM, GPT2Tokenizer
import torch
from flask import Flask, request, jsonify, abort

# Load the model and tokenizer
model = GPTNeoForCausalLM.from_pretrained("./gpt_neo_1.3B")
tokenizer = GPT2Tokenizer.from_pretrained("./gpt_neo_1.3B")

# Define a pad_token if it's not already defined
if tokenizer.pad_token is None:
    tokenizer.add_special_tokens({'pad_token': '[PAD]'})

# Resize the token embeddings with mean_resizing=False
model.resize_token_embeddings(len(tokenizer), mean_resizing=False)

# Set the pad_token_id to the new padding token (e.g., '[PAD]')
model.config.pad_token_id = tokenizer.pad_token_id

# Print pad_token_id and eos_token_id to ensure proper configuration
print("Pad token ID:", tokenizer.pad_token_id)
print("EOS token ID:", tokenizer.eos_token_id)

model.eval()

# Set up Flask app
app = Flask(__name__)

@app.route('/generate', methods=['POST'])
def generate_text():
    
    # Get the input text from the request
    data = request.json
    input_text = data.get('text', '')
    
    if not input_text:
        return jsonify({"error": "No text provided"}), 400  # Bad Request

    # Tokenize input with padding, truncation, and attention mask
    inputs = tokenizer(input_text, return_tensors='pt', padding=True, truncation=True, max_length=512)

    # Generate output with attention_mask, explicitly setting pad_token_id
    with torch.no_grad():
        outputs = model.generate(
            inputs['input_ids'], 
            attention_mask=inputs['attention_mask'],
            max_length=100,  # Increase max_length
            pad_token_id=tokenizer.pad_token_id,
            eos_token_id=tokenizer.eos_token_id,
            do_sample=True,  # Enable sampling to diversify output
            top_k=50,  # Use top-k sampling
            top_p=0.95,  # Use nucleus sampling
            temperature=0.7  # Control creativity of the output
        )

    # Decode and return output
    generated_text = tokenizer.decode(outputs[0], skip_special_tokens=True)
    return jsonify({"generated_text": generated_text})
if __name__ == '__main__':
    app.run(host='0.0.0.0', port=5000)

Request:

curl -X POST http://localhost:5000/generate -H "Content-Type: application/json" -d '{"text": "What time is it in japan"}'

Response:

{"generated_text":"What time is it in japanese?\n\n"}

本文标签： pythonGPTNeo13B not giving answers when called via APIStack Overflow

版权声明：本文标题：python - GPT-Neo-1.3B not giving answers when called via API - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1744394953a2604169.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

python - GPT-Neo-1.3B not giving answers when called via API - Stack Overflow

更多相关文章

python - GPT-Neo-1.3B not giving answers when called via API - Stack Overflow

发表评论

推荐文章

html - Hideshow elements with javascript - Stack Overflow

Test deployment of google sheet add-on - Stack Overflow

javascript - JQuery - Get an offset when loading and going directly to a URL with anchor - Stack Overflow

quarkus - Is there a way to use @ConfigMapping in a test without @QuarkusTest? - Stack Overflow

javascript - Limit Pagination Numbers - Stack Overflow

热门文章

javascript - How to include SVG file in React? - Stack Overflow

javascript - document.getElementById returns null in Netsuite client script and in developers tool - Stack Overflow

javascript - Prevent Duplicate Documents in MongoDB? - Stack Overflow

javascript - next.config.js must NOT have additional properties - Stack Overflow

Hiding keycloak behind spring cloud gateway and disabling basic auth - Stack Overflow

javascript - Methods for troubleshooting "A script on this page is causing IE to run slowly" caused by AJAX? -

javascript - error TS2345: Argument of type 'Event' is not assignable to parameter of type '{ target: {

javascript - How to show the drop down box when something is typed in the text field - Stack Overflow

javascript - How to get previous value of <select> in React? - Stack Overflow

javascript - Storing JS arrays and objects in a database - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

How To Convert HTML to PDF using JavaScript - Stack Overflow

woocommerce offtopic - Use Hooks to Limit One Comment Per User Per Post - Hide Form if Already Commented

excel - How to identify if Windows is running on a Mac in a virtual machine - Stack Overflow

Using javascript with applescript to submit data on a webpage - Stack Overflow

javascript - Callback is not firing in ExtJs Store - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

编程频道|软件玩家 - 软件改变生活！

python - GPT-Neo-1.3B not giving answers when called via API - Stack Overflow

更多相关文章

python - GPT-Neo-1.3B not giving answers when called via API - Stack Overflow

发表评论

推荐文章

html - Hideshow elements with javascript - Stack Overflow

Test deployment of google sheet add-on - Stack Overflow

javascript - JQuery - Get an offset when loading and going directly to a URL with anchor - Stack Overflow

quarkus - Is there a way to use @ConfigMapping in a test without @QuarkusTest? - Stack Overflow

javascript - Limit Pagination Numbers - Stack Overflow

热门文章

javascript - How to include SVG file in React? - Stack Overflow

javascript - document.getElementById returns null in Netsuite client script and in developers tool - Stack Overflow

javascript - Prevent Duplicate Documents in MongoDB? - Stack Overflow

javascript - next.config.js must NOT have additional properties - Stack Overflow

Hiding keycloak behind spring cloud gateway and disabling basic auth - Stack Overflow

javascript - Methods for troubleshooting &quot;A script on this page is causing IE to run slowly&quot; caused by AJAX? -

javascript - error TS2345: Argument of type &#39;Event&#39; is not assignable to parameter of type &#39;{ target: {

javascript - How to show the drop down box when something is typed in the text field - Stack Overflow

javascript - How to get previous value of &lt;select&gt; in React? - Stack Overflow

javascript - Storing JS arrays and objects in a database - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

How To Convert HTML to PDF using JavaScript - Stack Overflow

woocommerce offtopic - Use Hooks to Limit One Comment Per User Per Post - Hide Form if Already Commented

excel - How to identify if Windows is running on a Mac in a virtual machine - Stack Overflow

Using javascript with applescript to submit data on a webpage - Stack Overflow

javascript - Callback is not firing in ExtJs Store - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价

javascript - Methods for troubleshooting "A script on this page is causing IE to run slowly" caused by AJAX? -

javascript - error TS2345: Argument of type 'Event' is not assignable to parameter of type '{ target: {

javascript - How to get previous value of <select> in React? - Stack Overflow