Git Product home page Git Product logo

getdatav1's Introduction

xmile-getData

使用http Client4.4模拟请求 并且对验证码处理 验证码处理涉及得比较多图像处理 在噪点处理上 使用中值滤波 切割验证码 平均切割 二值化验证码 黑白二值 在登陆密码上面 使用了 MD5加密 并且是大写

重点问题: 1.json返回的数据中文乱码 通过 String content = EntityUtils.toString(r.getEntity(), "UTF-8"); 设置编码格式解决 2.sessionID问题 在cookie中添加sessionID才能登陆成功 3.验证码处理达到80%的准确率, 后面需要加强--未解决 4.一旦发起请求需要对响应码进行处理,登陆不成功需重新登陆--已解决(用自动登陆ID代替) 5.添加swing框架来获取用户的输入--还没done(已解决 但渲染层较慢 )--已经替换日期渲染控件

最后修复本-FINAL VERSION

  每次统计都重新登陆获得http上下文 通过上下文进行url的访问   使用future callable 方式提交任务查询 提高了整个查询效率 由原来单线程查询数据5000条的30m 优化到现在的5-6m 同时对于网络速率也 是有一定的限制

getdatav1's People

Contributors

wenthywang avatar

Watchers

James Cloos avatar  avatar

getdatav1's Issues

模拟请求 报错

可能存在某次模拟请求不成功 没有进行捕获 和日志处理

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.