admin管理员组

文章数量:1345318

Hi plete newbie here so bear with me. Seems like a simple job but I can't seem to find an easy way to do this.

So I need to extract a particular text from a webpage "www.example/index.php". I know that the text would be available in p tag with certain id. How do I extract this data out using javascript?

What I'm trying currently is that I have my javascript file (trying.js) on my puter with the following code:

$(document).ready(function () {
    $.get("www.example/index.php", function(data) {
        console.log(data)
    }) ;
});

and a html that runs the javascript file.

When I open this html page with firefox it doesn't show me anything in console. How do I get the website's data? Am I on the correct track here? Is there a better way to do this?

Hi plete newbie here so bear with me. Seems like a simple job but I can't seem to find an easy way to do this.

So I need to extract a particular text from a webpage "www.example./index.php". I know that the text would be available in p tag with certain id. How do I extract this data out using javascript?

What I'm trying currently is that I have my javascript file (trying.js) on my puter with the following code:

$(document).ready(function () {
    $.get("www.example./index.php", function(data) {
        console.log(data)
    }) ;
});

and a html that runs the javascript file.

When I open this html page with firefox it doesn't show me anything in console. How do I get the website's data? Am I on the correct track here? Is there a better way to do this?

Share Improve this question asked Oct 4, 2013 at 13:02 VivekVivek 1631 gold badge2 silver badges9 bronze badges 8
  • 11 You can't, javascript has a same origin policy, so you don't have access to other websites than those on the same domain or services that support JSONP or CORS. – adeneo Commented Oct 4, 2013 at 13:03
  • 1 possible duplicate of Can Javascript read the source of any web page? – Blazemonger Commented Oct 4, 2013 at 13:06
  • 1 You need to write an app, maybe using Selenium or Watin browser automation or my new favorite CSQuery (it has only read access to the DOM but uses JQuery style filters in CSharp and is really fast). – iCollect.it Ltd Commented Oct 4, 2013 at 13:23
  • What you're looking for is a page scraper. Javascript can't pull it off because it can only gather data from the domain you're on. You could build it in Ruby, for example, and use one of the many existing gems for this sort of task, like github./assaf/scrapi or nokogiri – Orlando Commented Oct 4, 2013 at 13:34
  • Please take a look at stackoverflow./questions/680562/… There are multiple ways discussed. Hope it helps you. – Dropout Commented Oct 4, 2013 at 13:43
 |  Show 3 more ments

1 Answer 1

Reset to default 0

Due to Same-Origin Policy (CORS), which prevents JavaScript from making direct requests to different domains for security reasons. However you can do that using

    1 - Use a proxy server
$(document).ready(function () {
    $.get("https://your-proxy-server./fetch?url=https://www.example./index.php", function(data) {
        const parser = new DOMParser();
        const doc = parser.parseFromString(data, 'text/html');
        const text = doc.getElementById('your-id-here').textContent;
        console.log(text);
    });
});

2 -  Use the Fetch API with a server that supports CORS
fetch('https://www.example./index.php', {
    method: 'GET',
    headers: {
        'Accept': 'text/html'
    }
})
.then(response => response.text())
.then(data => {
    const parser = new DOMParser();
    const doc = parser.parseFromString(data, 'text/html');
    const text = doc.getElementById('your-id-here').textContent;
    console.log(text);
})
.catch(error => console.error('Error:', error));

本文标签: jqueryHow do I extract data from a website using javascriptStack Overflow