facebook-scraper icon indicating copy to clipboard operation
facebook-scraper copied to clipboard

Stories support?

Open hmijail opened this issue 2 years ago • 1 comments

Just reporting in case this can be fixed.

Links to Stories only get a response redirecting to the m.facebook.com version of the link.

>>> for post in get_posts(post_urls=["https://www.facebook.com/stories/XXX/YYY/?view_single=true"],cookies="from_browser",extra_info=True,options={"comments":  True}, youtube_dl=True):
...     pp.pprint(post)
...

{   'original_request_url': 'https://www.facebook.com/stories/XXX/YYY/?view_single=true',
    'post_url': 'https://m.facebook.com/stories/XXX/YYY/?view_single=true'}

If I retry the request with that link it just repeats the same response.

hmijail avatar Jun 27 '22 06:06 hmijail

Hi - https://github.com/kevinzg/facebook-scraper/commit/2e8b56fb8962247e032a1ed23d8fef1bf408f5fc and https://github.com/kevinzg/facebook-scraper/commit/fe1819e2b782832ddf02e00d281ae1db7d1ac0e9 should make this possible. I tested with 4 public stories, 2 with images and 2 with videos, using this code:


set_cookies("cookies.json")

pprint(
    list(
        get_posts(
            post_urls=[
                "https://m.facebook.com/story/view/?bucket_id=1728197760537301&source=story_tray&_rdr",
                "https://m.facebook.com/stories/view_tray_pagination/618128721880095/?tray_session_id=32f907f4-103c-4f5f-b047-b0bd626ea0f6&thread_id=596727791757667&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MjE2MzoyMDgyMjY2ODg4NDU1NjM4Oi0xOmF0bjoxNTMyNTM1MDI2OTUyNjk2NDU4OjA6MTY1NjM5MjE2Mzo%3D&has_next_page=false",
                "https://m.facebook.com/stories/view_tray_pagination/1713994855341221/?tray_session_id=84fa68f2-ea42-46a2-8562-a76e1a484fb7&thread_id=419763783405060&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MzUzMToyNzk2NTE3MjMzNzY3MjE2Oi0xOmF0bjo0NzQ5NjA0NjQzNjExMjU3OjA6MTY1NjM5MzUzMTo%3D&has_next_page=false",
                "https://m.facebook.com/stories/view_tray_pagination/170893644397665/?tray_session_id=7c5346ca-877e-45e4-8aec-f6b817cd11f7&thread_id=1215826175836201&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MzU4ODoxNDgyNjk3MDE1MTAzNDQ2Oi0xOmF0bjoyNDU1NjM2NTE4MzQxOTIyMTI5OjA6MTY1NjM5MzU4ODo%3D&has_next_page=true"
            ]
        )
    )
)

And these were the results:

[{'available': True,
  'comments': 0,
  'comments_full': None,
  'factcheck': None,
  'image': None,
  'image_id': None,
  'image_ids': [],
  'image_lowquality': 'https://scontent.fakl8-1.fna.fbcdn.net/v/t39.30808-6/290626587_10160382238642915_6004618627465148590_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=108&ccb=1-7&_nc_sid=5e1285&efg=eyJpIjoidCJ9&_nc_ohc=RL4nDtBwV0EAX9V-UXb&_nc_ht=scontent.fakl8-1.fna&oh=00_AT_Y-FR8mi8lptnmewH2FdjSIrJq18aICL4-F6X67NPi_w&oe=62BFDF56',
  'images': [],
  'images_description': [],
  'images_lowquality': ['https://scontent.fakl8-1.fna.fbcdn.net/v/t39.30808-6/290626587_10160382238642915_6004618627465148590_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=108&ccb=1-7&_nc_sid=5e1285&efg=eyJpIjoidCJ9&_nc_ohc=RL4nDtBwV0EAX9V-UXb&_nc_ht=scontent.fakl8-1.fna&oh=00_AT_Y-FR8mi8lptnmewH2FdjSIrJq18aICL4-F6X67NPi_w&oe=62BFDF56',
                        'https://scontent.fakl8-1.fna.fbcdn.net/v/t39.30808-6/290626587_10160382238642915_6004618627465148590_n.jpg?stp=cp0_dst-jpg_e15_p320x320_q65&_nc_cat=108&ccb=1-7&_nc_sid=5e1285&efg=eyJpIjoidCJ9&_nc_ohc=RL4nDtBwV0EAX9V-UXb&_nc_ht=scontent.fakl8-1.fna&oh=00_AT9UTqrnYv-pErjcJ-95acWyz2MOO1-w5lgbez-BkytfNg&oe=62BFDF56'],
  'images_lowquality_description': ['Photo', None],
  'is_live': False,
  'likes': 0,
  'link': None,
  'links': [],
  'original_request_url': 'https://m.facebook.com/story/view/?bucket_id=1728197760537301&source=story_tray&_rdr',
  'original_text': None,
  'page_id': None,
  'post_id': None,
  'post_text': '',
  'post_url': 'https://m.facebook.com/story/view/?bucket_id=1728197760537301&source=story_tray&_rdr',
  'reaction_count': None,
  'reactions': None,
  'reactors': None,
  'shared_post_id': None,
  'shared_post_url': None,
  'shared_text': 'Delete photo?',
  'shared_time': None,
  'shared_user_id': None,
  'shared_username': None,
  'sharers': None,
  'shares': 0,
  'text': 'Delete photo?',
  'time': datetime.datetime(2022, 6, 28, 9, 20, 26),
  'timestamp': 1656364826,
  'user_id': None,
  'user_url': 'https://facebook.com/TheEdgeNZ/',
  'username': 'The Edge',
  'video': None,
  'video_duration_seconds': None,
  'video_height': None,
  'video_id': None,
  'video_quality': None,
  'video_size_MB': None,
  'video_thumbnail': None,
  'video_watches': None,
  'video_width': None,
  'w3_fb_url': None,
  'was_live': False,
  'with': None},
 {'available': True,
  'comments': 0,
  'comments_full': None,
  'factcheck': None,
  'image': None,
  'image_id': None,
  'image_ids': [],
  'image_lowquality': 'https://scontent.fakl8-1.fna.fbcdn.net/v/t39.30808-6/289256316_5318064961582860_7230078798085311585_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=111&ccb=1-7&_nc_sid=5e1285&efg=eyJpIjoidCJ9&_nc_ohc=4XtjPxfsInoAX8BeMGH&_nc_ht=scontent.fakl8-1.fna&oh=00_AT-uS0wUrcHEvKDBuGlqNRB7oD_xEh-Xscm8B3E4eKXtvg&oe=62BFCF66',
  'images': [],
  'images_description': [],
  'images_lowquality': ['https://scontent.fakl8-1.fna.fbcdn.net/v/t39.30808-6/289256316_5318064961582860_7230078798085311585_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=111&ccb=1-7&_nc_sid=5e1285&efg=eyJpIjoidCJ9&_nc_ohc=4XtjPxfsInoAX8BeMGH&_nc_ht=scontent.fakl8-1.fna&oh=00_AT-uS0wUrcHEvKDBuGlqNRB7oD_xEh-Xscm8B3E4eKXtvg&oe=62BFCF66',
                        'https://scontent.fakl8-1.fna.fbcdn.net/v/t39.30808-6/289256316_5318064961582860_7230078798085311585_n.jpg?stp=cp0_dst-jpg_e15_p320x320_q65&_nc_cat=111&ccb=1-7&_nc_sid=5e1285&efg=eyJpIjoidCJ9&_nc_ohc=4XtjPxfsInoAX8BeMGH&_nc_ht=scontent.fakl8-1.fna&oh=00_AT8Do5TWKIaMqHBDKQwsczVaBySay8nuy_EDdvwEEteD9w&oe=62BFCF66'],
  'images_lowquality_description': ['Photo', None],
  'is_live': False,
  'likes': 0,
  'link': None,
  'links': [],
  'original_request_url': 'https://m.facebook.com/stories/view_tray_pagination/618128721880095/?tray_session_id=32f907f4-103c-4f5f-b047-b0bd626ea0f6&thread_id=596727791757667&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MjE2MzoyMDgyMjY2ODg4NDU1NjM4Oi0xOmF0bjoxNTMyNTM1MDI2OTUyNjk2NDU4OjA6MTY1NjM5MjE2Mzo%3D&has_next_page=false',
  'original_text': None,
  'page_id': None,
  'post_id': None,
  'post_text': '',
  'post_url': 'https://m.facebook.com/stories/view_tray_pagination/618128721880095/?tray_session_id=32f907f4-103c-4f5f-b047-b0bd626ea0f6&thread_id=596727791757667&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MjE2MzoyMDgyMjY2ODg4NDU1NjM4Oi0xOmF0bjoxNTMyNTM1MDI2OTUyNjk2NDU4OjA6MTY1NjM5MjE2Mzo%3D&has_next_page=false',
  'reaction_count': None,
  'reactions': None,
  'reactors': None,
  'shared_post_id': None,
  'shared_post_url': None,
  'shared_text': 'Delete photo?',
  'shared_time': None,
  'shared_user_id': None,
  'shared_username': None,
  'sharers': None,
  'shares': 0,
  'text': 'Delete photo?',
  'time': datetime.datetime(2022, 6, 27, 18, 0, 11),
  'timestamp': 1656309611,
  'user_id': None,
  'user_url': 'https://facebook.com/hangarbarnz/',
  'username': 'The Hangar',
  'video': None,
  'video_duration_seconds': None,
  'video_height': None,
  'video_id': None,
  'video_quality': None,
  'video_size_MB': None,
  'video_thumbnail': None,
  'video_watches': None,
  'video_width': None,
  'w3_fb_url': None,
  'was_live': False,
  'with': None},
 {'available': True,
  'comments': 0,
  'comments_full': None,
  'factcheck': None,
  'image': None,
  'image_id': None,
  'image_ids': [],
  'image_lowquality': 'https://scontent.fakl8-1.fna.fbcdn.net/v/t51.36329-10/290357691_544036690782844_7886211504004685548_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=105&ccb=1-7&_nc_sid=08861d&efg=eyJpIjoidCJ9&_nc_ohc=aj5SNVj2PTEAX8m1pAU&tn=iGTasn8pTzY0Dad4&_nc_ht=scontent.fakl8-1.fna&oh=00_AT-tRh5PMJnKkIxq0IoOKnhF5KZKyDZBF4gu-VG7PsOYWw&oe=62BEF241',
  'images': [],
  'images_description': [],
  'images_lowquality': ['https://scontent.fakl8-1.fna.fbcdn.net/v/t51.36329-10/290357691_544036690782844_7886211504004685548_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=105&ccb=1-7&_nc_sid=08861d&efg=eyJpIjoidCJ9&_nc_ohc=aj5SNVj2PTEAX8m1pAU&tn=iGTasn8pTzY0Dad4&_nc_ht=scontent.fakl8-1.fna&oh=00_AT-tRh5PMJnKkIxq0IoOKnhF5KZKyDZBF4gu-VG7PsOYWw&oe=62BEF241',
                        'https://scontent.fakl8-1.fna.fbcdn.net/v/t51.36329-10/290357691_544036690782844_7886211504004685548_n.jpg?stp=cp0_dst-jpg_e15_p320x320_q65&_nc_cat=105&ccb=1-7&_nc_sid=08861d&efg=eyJpIjoidCJ9&_nc_ohc=aj5SNVj2PTEAX8m1pAU&tn=iGTasn8pTzY0Dad4&_nc_ht=scontent.fakl8-1.fna&oh=00_AT8gxbvwis85hTuxo9bdGCf_SJBl-XuaLRNN9NNXI2YUkQ&oe=62BEF241'],
  'images_lowquality_description': [None, None],
  'is_live': False,
  'likes': 0,
  'link': None,
  'links': [],
  'original_request_url': 'https://m.facebook.com/stories/view_tray_pagination/1713994855341221/?tray_session_id=84fa68f2-ea42-46a2-8562-a76e1a484fb7&thread_id=419763783405060&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MzUzMToyNzk2NTE3MjMzNzY3MjE2Oi0xOmF0bjo0NzQ5NjA0NjQzNjExMjU3OjA6MTY1NjM5MzUzMTo%3D&has_next_page=false',
  'original_text': None,
  'page_id': None,
  'post_id': None,
  'post_text': '',
  'post_url': 'https://m.facebook.com/stories/view_tray_pagination/1713994855341221/?tray_session_id=84fa68f2-ea42-46a2-8562-a76e1a484fb7&thread_id=419763783405060&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MzUzMToyNzk2NTE3MjMzNzY3MjE2Oi0xOmF0bjo0NzQ5NjA0NjQzNjExMjU3OjA6MTY1NjM5MzUzMTo%3D&has_next_page=false',
  'reaction_count': None,
  'reactions': None,
  'reactors': None,
  'shared_post_id': None,
  'shared_post_url': None,
  'shared_text': 'Delete video?',
  'shared_time': None,
  'shared_user_id': None,
  'shared_username': None,
  'sharers': None,
  'shares': 0,
  'text': 'Delete video?',
  'time': datetime.datetime(2022, 6, 27, 20, 49, 43),
  'timestamp': 1656319783,
  'user_id': None,
  'user_url': 'https://facebook.com/UniofAkl/',
  'username': 'University of Auckland - Waipapa Taumata Rau',
  'video': 'https://video.fakl8-1.fna.fbcdn.net/v/t50.12441-16/290308792_561034728886386_3151359295448096015_n.mp4?_nc_cat=101&ccb=1-7&_nc_sid=985c63&efg=eyJ2ZW5jb2RlX3RhZyI6Inhwdl9oZF9wcm9ncmVzc2l2ZSJ9&_nc_ohc=b9eAx5iUUukAX_ptRE7&_nc_ht=video.fakl8-1.fna&oh=00_AT9WvaAr-kIfx31DgT_qMzhPcNvGZF-Ni_10KuOHwBAvSQ&oe=62BB096D',
  'video_duration_seconds': None,
  'video_height': None,
  'video_id': '1124253208432765',
  'video_quality': None,
  'video_size_MB': None,
  'video_thumbnail': None,
  'video_watches': None,
  'video_width': None,
  'w3_fb_url': None,
  'was_live': False,
  'with': None},
 {'available': True,
  'comments': 0,
  'comments_full': None,
  'factcheck': None,
  'image': None,
  'image_id': None,
  'image_ids': [],
  'image_lowquality': 'https://scontent.fakl8-1.fna.fbcdn.net/v/t51.36329-10/290624820_552960919745621_502266866629628509_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=110&ccb=1-7&_nc_sid=08861d&efg=eyJpIjoidCJ9&_nc_ohc=wtb4ymkaxSAAX-fgRVx&_nc_ht=scontent.fakl8-1.fna&oh=00_AT9AXs6uMntKbLK2DZ9Qfq0oqhRCThwbHAhrWgwxuKZixg&oe=62BF5B00',
  'images': [],
  'images_description': [],
  'images_lowquality': ['https://scontent.fakl8-1.fna.fbcdn.net/v/t51.36329-10/290624820_552960919745621_502266866629628509_n.jpg?stp=cp0_dst-jpg_e15_fb50_p296x100_q65&_nc_cat=110&ccb=1-7&_nc_sid=08861d&efg=eyJpIjoidCJ9&_nc_ohc=wtb4ymkaxSAAX-fgRVx&_nc_ht=scontent.fakl8-1.fna&oh=00_AT9AXs6uMntKbLK2DZ9Qfq0oqhRCThwbHAhrWgwxuKZixg&oe=62BF5B00',
                        'https://scontent.fakl8-1.fna.fbcdn.net/v/t51.36329-10/290624820_552960919745621_502266866629628509_n.jpg?stp=cp0_dst-jpg_e15_p320x320_q65&_nc_cat=110&ccb=1-7&_nc_sid=08861d&efg=eyJpIjoidCJ9&_nc_ohc=wtb4ymkaxSAAX-fgRVx&_nc_ht=scontent.fakl8-1.fna&oh=00_AT-8Wg5KYcxVkN64Rp-HqU-mOyOr9lXG28EG7PzKS99Ivw&oe=62BF5B00'],
  'images_lowquality_description': [None, None],
  'is_live': False,
  'likes': 0,
  'link': None,
  'links': [],
  'original_request_url': 'https://m.facebook.com/stories/view_tray_pagination/170893644397665/?tray_session_id=7c5346ca-877e-45e4-8aec-f6b817cd11f7&thread_id=1215826175836201&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MzU4ODoxNDgyNjk3MDE1MTAzNDQ2Oi0xOmF0bjoyNDU1NjM2NTE4MzQxOTIyMTI5OjA6MTY1NjM5MzU4ODo%3D&has_next_page=true',
  'original_text': None,
  'page_id': None,
  'post_id': None,
  'post_text': '',
  'post_url': 'https://m.facebook.com/stories/view_tray_pagination/170893644397665/?tray_session_id=7c5346ca-877e-45e4-8aec-f6b817cd11f7&thread_id=1215826175836201&end_cursor=NzI5NDAyNjE5OjE6MTY1NjM5MzU4ODoxNDgyNjk3MDE1MTAzNDQ2Oi0xOmF0bjoyNDU1NjM2NTE4MzQxOTIyMTI5OjA6MTY1NjM5MzU4ODo%3D&has_next_page=true',
  'reaction_count': None,
  'reactions': None,
  'reactors': None,
  'shared_post_id': None,
  'shared_post_url': None,
  'shared_text': 'Delete video?',
  'shared_time': None,
  'shared_user_id': None,
  'shared_username': None,
  'sharers': None,
  'shares': 0,
  'text': 'Delete video?',
  'time': datetime.datetime(2022, 6, 27, 23, 16, 48),
  'timestamp': 1656328608,
  'user_id': None,
  'user_url': 'https://facebook.com/pendulum',
  'username': 'Pendulum',
  'video': 'https://video.fakl8-1.fna.fbcdn.net/v/t50.12441-16/289603386_741985486946627_5767020682908648770_n.mp4?_nc_cat=106&ccb=1-7&_nc_sid=985c63&efg=eyJ2ZW5jb2RlX3RhZyI6Inhwdl9oZF9wcm9ncmVzc2l2ZSJ9&_nc_ohc=WgFBXERR6GYAX-fQZAe&_nc_ht=video.fakl8-1.fna&oh=00_AT87tijzGl5L9ICdgPySsnEprox1GaVbNt3a99y_7F_b2w&oe=62BB1E18',
  'video_duration_seconds': None,
  'video_height': None,
  'video_id': '1757740114557438',
  'video_quality': None,
  'video_size_MB': None,
  'video_thumbnail': None,
  'video_watches': None,
  'video_width': None,
  'w3_fb_url': None,
  'was_live': False,
  'with': None}]

It looks like the last image in images_lowquality is the most useful (hint: post["images_lowquality"][-1]). It looks like it's possible to extract exact timestamps (down to the second), user URLs and usernames. video post extraction appears to work fine too. Give it a try and see how you go.

neon-ninja avatar Jun 28 '22 05:06 neon-ninja

Is it possible to automate the process of retrieving the URL of the stories from pages that we like?

Thanks in advance.

jeromelarman avatar Jan 07 '23 22:01 jeromelarman

If I use https://www.facebook.com/stories/{my_identifier]/ URL I can get the first stories ressource, but I can not iterate on each available stories. Does it exist a trick ?

I can confirm that get_posts by url does scrap well the first ressource.

Thanks in advance .

jeromelarman avatar Jan 08 '23 12:01 jeromelarman