Having a bit of a headache with a scraping scenario I'm trying in Google Sheets.
In a nutshell, we want to use Google Sheets with ImportXML to create scraped feed from clients' websites pulling product details.
Here is a link to the smaller version of the doc. https://docs.google.com/a/sprt.co.za/spreadsheets/d/1dSbglYniWa_cijb6yDty576j33CTk9Cf8J38a3VXHSU/edit?usp=sharing
Currently this specific client only has the Item Price, etc details in a text area in the code. So when I use =ImportXml($C$2, "//textarea") it gives me the entire text area across two cells. From these cells, actually only the second one I need to pull out details but I am pretty stuck on the Regex on a piece if data this big.
" { ""id"": ""061013AACI9"", ""productId"": ""061013AACI9"", ""name"": ""VANS MEN'S
PERFORATED LEATHER ERA"", ""price"": ""R 799.00"", ""oldPrice"": """", ""brand"":
""Vans"", ""brandURL"": ""/plp/vans/_/N-1z140je"", ""defaultImages"": [ ],
""images"": [ { ""thumb"":
""http://tfgsrv.wigroup.co/06/Thumbnail/31460739.jpg"", ""large"":
""http://tfgsrv.wigroup.co/06/Detail/31460739.jpg"" } , { ""thumb"":
""http://tfgsrv.wigroup.co/06/ThumbnailAlternative/31460739_01.jpg"",
""large"": ""http://tfgsrv.wigroup.co/06/DetailAlternative/31460739_01.jpg"" }
, { ""thumb"":
""http://tfgsrv.wigroup.co/06/ThumbnailAlternative/31460739_02.jpg"",
""large"": ""http://tfgsrv.wigroup.co/06/DetailAlternative/31460739_02.jpg"" }
, { ""thumb"":
""http://tfgsrv.wigroup.co/06/ThumbnailAlternative/31460739_03.jpg"",
""large"": ""http://tfgsrv.wigroup.co/06/DetailAlternative/31460739_03.jpg"" }
], ""transientProfile"": ""true"", ""wishListId"": ""anonymous"", ""colors"": [ {
""id"": ""31460739"", ""name"": ""White"", ""path"":
""http://tfgsrv.wigroup.co/06/ColourSwatch/31460739_SW.jpg"", ""activeColor"" :
true, ""available"" : true } ], ""sizes"": [ { ""id"": ""31460740_06"", ""name"":
""6"", ""available"": false } , { ""id"": ""31460741_06"", ""name"": ""7"",
""available"": true } , { ""id"": ""31460742_06"", ""name"": ""8"", ""available"": true
} , { ""id"": ""31460743_06"", ""name"": ""9"", ""available"": false } , { ""id"":
""31460744_06"", ""name"": ""10"", ""available"": true } , { ""id"": ""31460745_06"",
""name"": ""11"", ""available"": false } ], ""productType"" : ""ColourSize"" } "
I need to pull out the R 799.00 value from that mess. So if anyone is willing to help out. Because frankly my talent and skill has run it's course in trying to navigate that with RegEx.
Collected from the Internet
Please contact [email protected] to delete if infringement.
Comments