Honors 1000, Productive Data Manipulation in Python and SQL (Fall 2018)
Project - U.S. Congress Data (Due: Sunday, December 9, 2018)
This is a group project; Ideally 2 students per group. Please make one submission per group. Put down names of group members in the program files as comments.Consider the data avaliable at the following websites; Write Python/BeautifulSoup programs to retrieve this data and populate MySQL tables defined below:
https://www.senate.gov/senators/ https://infogalactic.com/info/List_of_current_United_States_Senators_by_age - THIS DOES NOT WORK; PERMISSION DENIED USE THIS INSTEAD: https://ipfs.io/ipfs/QmXoypizjW3WknFiJnKLwHCnL72vedxjQkDDP1mXWo6uco/wiki/List_of_current_United_States_senators_by_age.html create table senator fname lname state party age gender https://www.house.gov/representatives create table representative fname lname state district party genderTo obtain the Gender of senators/congresspeople, you can use the following API:
https://genderize.io/ from gender import getGenders getGenders(['Dianne','Corey','Kamala']) [('female', 1, 416), ('male', 0.91, 638), ('female', 0.88, 16)] getGenders(['Kim']) [('female', 0.88, 3561)]
NOTE: If you do find better websites for similar data, please use those. If you can retrieve other data about the senators or congresspeople, you may do so.
handin submission under "project"
sudo handin1000 project file-name