我需要从这些超链接背后抓取一些数据本网站 http://www.echemportal.org/echemportal/propertysearch/treeselect_input.action?queryID=PROQ3h3n。然而,这些超链接是javascript function calls
,稍后提交form
using post
方法。经过一番搜索,selenium
似乎是一个候选人。所以我的问题是,我应该如何正确设置输入标签的值并提交不提交按钮的表单。
from selenium import webdriver
url = "http://www.echemportal.org/echemportal/propertysearch/treeselect_input.action?queryID=PROQ3h3n"
driver = webdriver.Firefox()
driver.get(url)
treePath_tag = driver.find_element_by_name("treePath")
在提交表单之前,我需要为标签赋值<input>
。但是,我收到了一个错误
消息:元素当前不可见,因此可能无法交互
和
treePath_tag.send_keys('/TR.SE00.00/QU.SE.DATA_ENV/QU.SE.ENV_ENVIRONMENT_DATA/QU.SE.EN_MONITORING')
如果上述正确,我想以这种方式提交表格。这是对的吗?
selenium.find_element_by_name("add_form").submit()
以下是网页来源。
JavaScript 函数
<script type="text/javascript">
function AddBlock(path){
document.add_form.treePath.value=path;
document.add_form.submit();
}
</script>
表单“add_form”
<form id="addblock_input" name="add_form" action="/echemportal/propertysearch/addblock_input.action" method="post" style="display:none;">
<table class="wwFormTable" style="display:none;"><tr style="display:none;">
<td colspan="2">
<input type="hidden" name="queryID" value="PROQ3h1w" id="addblock_input_queryID"/> </td>
</tr>
<tr style="display:none;">
<td colspan="2">
<input type="hidden" name="treePath" value="" id="addblock_input_treePath"/> </td>
</tr>
</table></form>
div 与 javascript 调用
<div id="querytree">
<h1>Property Search</h1>
<h2>Select Query Block Type</h2>
<p>Select a section for which to define query criteria.</p>
<div class="queryblocktools"><a href="javascript:document.load_form.submit();"><img style="vertical-align:top;" alt="Load" src="/echemportal/etc/img/load.gif"/> Load Query</a></div>
<ul class="listexpander">
<li>Physical and chemical properties<ul>
<li><a href="javascript:AddBlock('/TR.SE00.00/QU.SE.DATA_PHYS/QU.SE.PC_MELTING');">Melting point/freezing point</a></li>
<li><a href="javascript:AddBlock('/TR.SE00.00/QU.SE.DATA_PHYS/QU.SE.PC_BOILING');">Boiling point</a></li>
</ul>
</div>