I am pretty new to programming and kindly ask for your help.
I am currently analyzing travel restrictions imposed by countries during COVID outbreak. For the analysis I need to download a summary table from an online power bi dashboard, although it has no download option and no static URL. You could access the table in the 'travel restriction'-section of the website.
The data is being loaded after you scroll the page and one can't access the whole dataset untill one scrolls the page for quite a long time. I tried simple 'select all' (Ctrl+A combination) to copy at least parts of the dataset, but it doesn't work as well and helps to copy just one row.
Any ideas how to scrape the dataset? I would greatly appreciate any tips or solutions.
On this same page there is the DataSet:
Click on this and it brings you to excel download of data. Here you find the restriction data you are lookng for (at least this is wht I asume):
Related
I'm working on an online learning platform. I've used ng2-pdf-viewer to display my lessons which are in pdf format obviously. I'm looking for a way to create a side bar menu on which the user can find all the different chapters of the lesson.
LIKE THIS
I've thought about using page jumps to jump from one chapter to another as you click on its name on the side menu but that would be very unconvinient for me cause there will be quite alot of lessons and pdf files. I was looking for how the table of content thingy THIS ONE that you can find on any pdf file, was made but I didnt get anywhere. Any suggestions ?
Angular 2 + IntelliJ
I am new to web scraping and so far I only know how to scrape basic html page using python beautiful soup. What I want is to extract the information on this page. Specifically, I would like to get the following data from all the fellows (around 700 of them)
name
background
insight project
current employer
However, that page is rendered by javascript and the desired information only show up as a separate box when mouseover event is triggered on each fellows picture.
How to extract text in this case? Any information (books, web resources) is appreciated. Python solutions are preferred if possible. Many thanks.
Check the page source of the website.
The information is already present in the in the DOM, just hidden using CSS. On a first glance, it seems like the JavaScript logic is only doing CSS manipulations.
The fact that the information is hidden by CSS will not prevent you from scraping it from the source using a web scraping tool.
I want to let a user save a pre-defined part of a website to their computer in probably pdf format with some different css styles applied.
I have done the same for printing, but I couldn't find any examples for saving.
How would I go about doing this?
The specific parts are in their respective divs and ids.
Thanks
You'll need a library to do this.
depending on the language you're using, there's a lot of possibilities. Even google chrome itself print pdf from html.
Give us more details about your project
Ok so we have to implement an online e-reader using HTML5, JS and CSS. So far I have a pretty good understanding of those 3 technologies. What basically needs to happen is that a user picks a book from a gallery of links which takes him to a page to read the book.
Apparently the the epub file should be loaded into memory on the clients computer, unpacked and then displayed on the webpage. Up untill this point I don't have too many problems, what really racks my brain is what comes next.
So all the chapters are divided into seperate xhtml pages aswell as the cover, title page, table of contents which makes creating navigatable dynamic links to them not too hard. But, we also have to provide controls to change the font, colour and navigate between pages.
How do I implement this?
Does the JS to do this come in the page that loads the epub files and if so, how do I write JS that changes properties of a page loaded in the page the JS is in?
I can't think of another way to do this except to put a small JS widget on every single chapter, but then the problem of "turning pages" still exists as I do not know where one page starts and the other ends as this isn't specified in the xhtml files.
I'm sorry if I'm unclear or did not provide enough information, although I have worked with JS, HTML5 and CSS alot the whole epub thing is pretty new to me.
Any help would be appreciated.
I would check out one of the few open source projects dedicated to epubs on the web to get you started. You seem to be asking many questions so rather than answer specifically, I recommend you look at how some others have tackled the problems.
First you should check out Readium: http://readium.org/ They have many of the functions you've described (changing the font, picking a book from a library, etc.)
Next I would look at my personal project https://github.com/fchasen/epub.js
We do things a little differently but again it be a good place to look at how we've handled some challenges so you can come back with more specific questions.
I have been developing Android epub reader application, Using which I can Read Epubs, page by page even if it's font sizes have changed.All I need to do is to paginate html page, So That I can give a user book-like reading experience.that html page will contain images.
Currently I tried to implement it using Columising html page
using following CSS Style.
sytle="<style type=\"text/css\">body {width:"+displayWidth+"px;
height:"+displayHeight+"px;-webkit-column-gap:"+pagegap+"px;
-webkit-column-width:"+width+"px; -webkit-column-height:"
+displayHeight+"px; text-align:justify ;} </style>";
But problem with this approach is it show some images splitted ,which will be in different
columns.right and bottom of paragraph get clipped It Also have some Performance problem.
Is there is any better way to do pagination or Atleast How can I solve this splitting of image problem
Any informative links and hints all are welcome.
Thanks for your time.
Regards,
Kariyachan
If you can query the file, that you wish to take the data from, multiple times, in a way that it returns a particular set of data every time, then you might implement Paging functionality, by querying the file for a new set every time the user turns a page.