python - Trying to web scrape S&P500 data from Yahoo Finance but cannot retrieve despite all right format

admin管理员组
文章数量:1024592

I have been attempting to web scrape data from yahoo finance specifically on historical data of S&P 500 which its webpage url is '/%5EGSPC/history/?period1=1574074965&period2=1731927744'

As you can see from the picture below, it seems that yahoo finance is not providing access to my act of web scraping. Is there any other solution to overcome this and web scrape the data

S&P 500 data

I have been attempting to web scrape data from yahoo finance specifically on historical data of S&P 500 which its webpage url is 'https://finance.yahoo/quote/%5EGSPC/history/?period1=1574074965&period2=1731927744'

As you can see from the picture below, it seems that yahoo finance is not providing access to my act of web scraping. Is there any other solution to overcome this and web scrape the data

S&P 500 data

Share Improve this question asked Nov 18, 2024 at 11:29 jumbo 111 bronze badge

Can you confirm whether you want the Table of content available in the URL you provided? – Moses01 Commented Nov 18, 2024 at 11:59
Please make sure to post code and console output as code-fenced text, not images. Images can't be copy-pasted, load slower, take more bandwidth, and are less accessible. – Anerdw Commented Nov 19, 2024 at 0:21

Add a comment |

1 Answer 1

Sorted by: Reset to default 2

Since the URL you use is Yahoo Finance and it is redirecting to multiple sites and fetching the data, but the beautifulsoup you use can try fetch up to 30 redirects only.

Instead of using Web Scrap, You can use Yahoo Finance module for Python. I noticed, You wanted to fetch the data from Nov 18,2019 to Nov 18,2024

So please use this code below to get the required data. You can change the dates as per your wish or use the below line to get all data

data = sp500.history(period="max")

Here is the code you should use:

import yfinance as yf
ticker = "^GSPC"
data = yf.Ticker(ticker)
hist = data.history(start="2019-11-18", end="2024-11-18")  # Specify date range
hist.to_csv("sp500_data.csv")

I have been attempting to web scrape data from yahoo finance specifically on historical data of S&P 500 which its webpage url is '/%5EGSPC/history/?period1=1574074965&period2=1731927744'

As you can see from the picture below, it seems that yahoo finance is not providing access to my act of web scraping. Is there any other solution to overcome this and web scrape the data

S&P 500 data

As you can see from the picture below, it seems that yahoo finance is not providing access to my act of web scraping. Is there any other solution to overcome this and web scrape the data

S&P 500 data

Share Improve this question asked Nov 18, 2024 at 11:29 jumbo 111 bronze badge

Can you confirm whether you want the Table of content available in the URL you provided? – Moses01 Commented Nov 18, 2024 at 11:59
Please make sure to post code and console output as code-fenced text, not images. Images can't be copy-pasted, load slower, take more bandwidth, and are less accessible. – Anerdw Commented Nov 19, 2024 at 0:21

Add a comment |

1 Answer 1

Sorted by: Reset to default 2

Since the URL you use is Yahoo Finance and it is redirecting to multiple sites and fetching the data, but the beautifulsoup you use can try fetch up to 30 redirects only.

Instead of using Web Scrap, You can use Yahoo Finance module for Python. I noticed, You wanted to fetch the data from Nov 18,2019 to Nov 18,2024

So please use this code below to get the required data. You can change the dates as per your wish or use the below line to get all data

data = sp500.history(period="max")

Here is the code you should use:

import yfinance as yf
ticker = "^GSPC"
data = yf.Ticker(ticker)
hist = data.history(start="2019-11-18", end="2024-11-18")  # Specify date range
hist.to_csv("sp500_data.csv")

本文标签：

版权声明：本文标题：python - Trying to web scrape S&P500 data from Yahoo Finance but cannot retrieve despite all right format - Stack Overfl 内容由热心网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://it.en369.cn/questions/1745621614a2159606.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

369IT编程

python - Trying to web scrape S&amp;P500 data from Yahoo Finance but cannot retrieve despite all right format - Stack Overfl

1 Answer 1

1 Answer 1

更多相关文章

javascript - SVG progress bar with image - Stack Overflow

node.js - Why multer.diskStorage() is not called? - Stack Overflow

javascript - Viewer.JS File Name from Title Attribute - Stack Overflow

math - JavaScript for Random Numbers with Recursion - Stack Overflow

syntax - Resolving Several Errors for a Line of Code in GAMS software - Stack Overflow

woocommerce offtopic - Custom access given to Admin dashboard

javascript - Writing physics simulation apps - Stack Overflow

javascript - cannot set property &#39;exports&#39; of undefined - Stack Overflow

javascript - Insane amount of RAM usage with BullMQ + fetch - Stack Overflow

javascript - Expire my link after X hour nodejs - Stack Overflow

c# - set razor @Html.Hidden within javascript function - Stack Overflow

php - How to prevent access to certain URL requested pages? - Stack Overflow

oauth - Domain verification for allizom.org in the client id authorized redirect urls for Firefox extensions - Stack Overflow

html - Javascript disable space scrolling - Stack Overflow

css - How to fix shifting header after installing speedcache plugins?

javascript - Angularjs ng-repeat: iterate over a special object fields - Stack Overflow

javascript - Executing a &lt;script&gt; tag on append after the DOM has loaded - Stack Overflow

javascript - writeFileSync doesn&#39;t callback - Stack Overflow

javascript - fill an array with getElementById - Stack Overflow

Need Help Moving Onclick Event To External Javascript To Comply With Content Security Policy - Stack Overflow

发表评论

推荐文章

How can one azure application mean for Teams tab be used by multiple domains - Stack Overflow

javascript - jQuery update (1.7) breaks event coords on touch event? - Stack Overflow

javascript - Download and save file in browser&#39;s local storage - Stack Overflow

javascript - Encrypt and decrypt a video with WebCrypto API using AES and a custom key - Stack Overflow

How can I not disable my theme when I want to upload a new version of it?

热门文章

javascript - Durandal widgets, dynamic templated parts - Stack Overflow

javascript - How can i remove bullet from dragable element? - Stack Overflow

List pages by custom field?

javascript - use react hook form with custom TextInput - Stack Overflow

javascript - Set star value in form after click on star - Stack Overflow

javascript - Jquery each sort by value - Stack Overflow

How to re-enable a filter after disabling with __return_false

javascript - Page uploads file twice? - Stack Overflow

Export All Posts and Media to XML andor Word

javascript - Set CSS counter-increment via jQuery - Stack Overflow

最新文章

windows设置断电重启开机后自动输入锁屏密码登录

Windows系统设置开机默认开启数字小键盘

Windows11 开机自动同步时间（开机时间不更新问题）

windows配置开机自启动软件或脚本

【Redis】Windows设置Redis为开机自启动

程序员刚毕业，先去大厂镀金还是先去小厂攒经验？

万象2008清空boss账户密码

【Tools】GitBook简明教程

oracle exadata celldisk 闪存盘受损导致性能下降

SDUT 2138 图结构练习——BFSDFS——判断可达性

sql - PostgreSQL CREATE USER - need for single quotes around password, no quotes for username - Stack Overflow

errors - Can&#39;t login to wordpress, got ERR_EMPTY_RESPONSE after a few minutes

JavaScript - SQL Reporting Services - Stack Overflow

javascript - Adding a callback to a jquery widget - Stack Overflow

reactjs - Request status changed with tab change - Stack Overflow

python - Trying to web scrape S&P500 data from Yahoo Finance but cannot retrieve despite all right format - Stack Overfl

javascript - cannot set property 'exports' of undefined - Stack Overflow

javascript - Executing a <script> tag on append after the DOM has loaded - Stack Overflow

javascript - writeFileSync doesn't callback - Stack Overflow

javascript - Download and save file in browser's local storage - Stack Overflow

errors - Can't login to wordpress, got ERR_EMPTY_RESPONSE after a few minutes