当前位置:网站首页>Is there any charge for PDF processing? impossible
Is there any charge for PDF processing? impossible
2022-07-29 10:06:00 【Seven step programming】
DevWeekly Collect and sort out weekly high-quality developer content , Include Open source project 、 Tool resources 、 Technical articles Other aspects , Starting every Friday on DevWeekly, Welcome to Star And collect !
Link to the original text :DevWeekly | The first 2 period : Refactoring code 10 Best practice
DevWeekly Starting every Friday on Github, Welcome to Star And collect !
DevWeekly Collect and sort out weekly high-quality developer content , Include Open source project 、 Resource tools 、 Technical articles Other aspects .
Welcome to contribute , Submit issue, Recommend or recommend open source projects / resources / Tools / article ~
Subscription mode :
- Star And collect items DevWeekly
- Focus on You know :Jackpop
Open source project
1.dot
Python Star:2.6k
dot(Deepfake Offensive Toolkit) It's a will deepfakes In real time 、 Controllable tools for online video .
that ,deepfakes What is it? ?
The simple translation is deep forgery , let me put it another way , It is through artificial intelligence technology , It can graft the facial features of the target object onto the imitated object , Change the face of the target object .
2.codecat
Python Star:241

CodeCat Is a can help you use the static code analysis function to find / Open source tools that track user input and security vulnerabilities , This is very valuable for security control in code development .
At present, it applies to C、C++、GO、Python、javascript、Swift、PHP、Ruby、ASP、Kotlin、Dart and Java.
To sum up , It has the following characteristics :
- according to Regex Custom rules for recursive code search
- Syntax highlights in the code view
- A system for managing rules
- Control users' resources
- according to IP Address allow list control HTTP Resources accessed
- follow OWASP Application of safety practices
3.OCRmyPDF
Python Star:6.8k
PDF There are many different types , Directly through word And other formats can be directly copied and pasted , Simple operation . however , Scanning version of PDF It's more troublesome .
OCRmyPDF For scanning PDF File to add OCR Text layer , So that it can be searched or copied and pasted .
It has the following characteristics :
- Based on the ordinary PDF Generate a searchable PDF/A file
- take OCR The text is placed exactly under the image , To facilitate replication / Paste
- Maintain the accurate resolution of the original embedded image
- Optimize PDF Images , The generated file is usually smaller than the input file
- Use Tesseract OCR engine , Support to identify 100 A variety of languages
- Can handle thousands of pages of documents
4.data-diff
Python Star:1.3k
data-diff Is a command line tool and Python library , It can be used to compare the differences between two different databases .
at present , It has been verified on many different types of databases , And the comparison results can be passed CLI/API To create monitoring and alarms , It can be compatible with different field types . It's also very fast ,10s It can be verified in 25M+ Row data .
data-diff Split the table into smaller pieces , Then check each segment in two databases . When the checksums of a segment are not equal , It will further divide this segment into smaller segments , Check these segments , Until it gets a different line .
5.cfg4j
Java Star:541
Modern configuration library , To use Java Distributed applications written .
cfg4j, That is to say configuration for Java, An open source configuration library , It can be used for Java Developed distributed applications .
It has the following characteristics :
- Auto reload configuration
- Powerful configuration mechanism ( Interface binding , Multi source support and fallback strategy , Merger and so on ).
- Distributed environment friendly ( cache , Support multiple environments [ test , Pre development , production ],…)
- Reading configuration :Consul、Git repos(YAML and / Or attributes )、 file 、Classpath, …
- And DI Seamless integration of containers :Spring、Guice etc.
- And Metrics Library integration to show performance metrics
- After a lot of testing (99% Code coverage )
- Complete documentation
6.Decompiler
Kotlin Star:58

An open source, free Java Decompile GUI Tools .
seeing the name of a thing one thinks of its function , With this tool , We can edit the compiled Java jar package , Can decompile complex Java applets And binary files , Generate accurate source code .
7.scrutiny
Go Star:1.5k

For smart S.M.A.R.T Monitored WebUI.
If you are running a server with several hard disks , You may already be familiar with S.M.A.R.T and smartd A daemon .
smartd It's a lot of monitoring ATA、IDE and SCSI-3 Self monitoring built in the hard disk 、 Analysis and Reporting Technology (SMART) System daemon .
SMART The purpose of is to monitor the reliability of hard disk and predict the failure of hard disk , And carry out self-test of different types of hard disks .
Scrutiny It is a simple and centralized application , There are several core functions :
- Web UI instrument panel
- smartd Integrate
- Automatically detect all connected hard disks Historical trend S.M.A.R.T Index tracking
- Customize thresholds with realistic failure rates
- Temperature tracking
8.sake
Go Star:435
sake Is a command running tool for local and remote hosts .
you are here sake.yaml Define servers and tasks in the configuration file , Then you can run the task on the server .
sake There are a lot of functions :
- Mission 、 Automatic completion of servers and labels
- adopt SSH Enter the server or docker Containers
- adopt sake list servers|tasks List servers / Mission
- Display task output in tabular format sake run result
9.codewarrior
C Star:170
An open source code search tool and static code analysis tool , It has the following functions :
- Search for code through coincidence code mode
- Custom rule
- With network socket resources HTTPd View
- Recursive scanning
- Highlight syntax through programming language
- Control through the Allow list IP Address access resources
10.wayfire
C++ Star:1.5k
Wayfire It's a 3D Wayland Synthesizer , Inspired by Compiz And based on wlroots.
It aims to create a customizable 、 Scalable and lightweight environment , Without sacrificing appearance .
Resource tools
1.Pomsky
Pomsky Is a language that can be compiled into regular expressions , It is currently in alpha Stage .
Pomsky Can pass CLI or Rust Macro to use .
Some examples :
# String
'hello world' # hello world
# Greedy repetition
'hello'{1,5} # (?:hello){1,5}
'hello'* # (?:hello)*
'hello'+ # (?:hello)+
# Lazy repetition
'hello'{1,5} lazy # (?:hello){1,5}?
'hello'* lazy # (?:hello)*?
'hello'+ lazy # (?:hello)+?
# Alternation
'hello' | 'world' # hello|world
2.GameShell
One learns how to use Unix shell The game of command .
Study Unix shell There is a certain cost ,GameShell Designed as a tool , Help students get in touch with shell, Through games, we can not only encourage learning, but also have fun .

3.Commands.dev
Every engineer has forgotten terminal commands , Because it is rarely used , Or there are complex parameters that are difficult to understand .
When that happens , Have to search in their terminal command history , Or open the browser to search .
Commands.dev It's about solving this problem .
Commands.dev It's an open source 、 Searchable 、 A collection of popular terminal commands collected from the entire Internet . You can sort (git、android、docker) Search for commands , Each command will tell you which parameters to insert .

4.PyScript

PyScript It is a kind of that can run in the browser Python.
It's a framework , Allow users to use HTML Interface and Pyodide、WASM etc. , Create rich Python Applications .
PyScript The framework provides users with a variety of experience levels with a variety of applications 、 Easy to learn programming language .
5.Hedy

Learning programming can be difficult , Not that programming itself is difficult , But there are many rules you need to remember , Just like when you start learning English .
similarly , You also need a lot of practice to learn programming .
Hedy Is a progressive programming language . It means , You don't have to learn all the rules at once , There are not so many rules in the first few levels , So you can easily adapt to programming .
At each level , Will add new rules , Increase the number of commands , Learn step by step .
Technical articles
1. use Java and Python Conduct data statistics and analysis
Mention data statistics and analysis , The first impression many students think of is Python perhaps R.
Actually ,Java It is also a programming language widely used in the field of data , for example ,Flink、hive All right Java Have good support .
In this article , Focus on how to use Java and Python Carry out simple data analysis on a large number of tabular data and calculate some statistical data .
ad locum , You can see different techniques for data analysis on each platform , Compare how they extend , And the possibility of applying parallel computing to improve its performance .
2. Refactoring code 10 Best practice
Writing code is not just for it to execute normally and output results , Also consider its design pattern 、 Extension line 、 Readability, etc .
Refactoring is the process of improving the design of existing code without changing its function .
As a software developer , We are constantly faced with the need to improve and optimize our code .
Whether it's for performance 、 Readability or maintainability , Refactoring code is a basic skill .
When refactoring code , There are many different technologies that can be used .
In this article , Let's introduce the method of refactoring code 10 Best practice .
边栏推荐
- Talk about multithreaded concurrent programming from a different perspective without heap concept
- Linear regression of machine learning (least square handwriting +sklearn Implementation)
- Selenium series 5-xpath path expression
- 最新翻译的官方PyTorch简易入门教程(PyTorch1.0版本)
- After the thunderstorm of two encryption companies: Celsius repayment guarantee collateral, three arrow capital closed and disappeared
- Shell notes (super complete)
- leetcode刷题——排序
- 造型科幻、标配6安全气囊,风行·游艇11.99万起售
- Notes for Resume Writing
- TCP failure model
猜你喜欢

A little knowledge ~ miscellaneous notes on topics ~ a polymorphic problem

Window系统操作技巧汇总
![[AAAI] attention based spatiotemporal graph convolution network for traffic flow prediction](/img/3d/717bc3a47a58470edd7a815a976320.png)
[AAAI] attention based spatiotemporal graph convolution network for traffic flow prediction

熊市下PLATO如何通过Elephant Swap,获得溢价收益?

尹伊:我的学习成长路径

开放原子开源基金会黄金捐赠人优博讯携手合作伙伴,助力OpenHarmony破圈!

一文读懂Plato Farm的ePLATO,以及其高溢价缘由
![[FPGA tutorial case 18] develop low delay open root calculation through ROM](/img/c3/02ce62fafb662d6b13aedde79e21fb.png)
[FPGA tutorial case 18] develop low delay open root calculation through ROM

Encyclopedia of introduction to machine learning - 2018 "machine learning beginners" official account article summary

Talk about multithreaded concurrent programming from a different perspective without heap concept
随机推荐
CS assurance and research experience in 2021 (IV): pre promotion and exemption of Xijiao soft Research Institute and the third room of Information Technology Institute
TCP failure model
ModuleNotFoundError: No module named ‘pywt‘解决方法
Orbslam2 installation test and summary of various problems
SiC Power Semiconductor Industry Summit Forum successfully held
Shell notes (super complete)
英特尔联合Datawhale,发布学习项目!
Dimensionality reduction and mathematical modeling after reading blog!
Excel tool for generating database table structure
The function of that sentence
【黑马早报】每日优鲜回应解散,多地已无法下单;李斌称蔚来将每年出一部手机;李嘉诚欲抄底恒大香港总部大楼;今年国庆休7天上7天...
一文读懂Plato Farm的ePLATO,以及其高溢价缘由
Harmonyos 3.0 release!
Sublime Text3 set different indents for different files
【C语言】三子棋(智能下棋 + 阻拦玩家)
智慧解决问题
"Focus on machines": Zhu Songchun's team built a two-way value alignment system between people and robots to solve major challenges in the field of human-computer cooperation
Window系统操作技巧汇总
Google Earth engine (GEE) -- calculate the location of the center point, the external boundary, the external polygon, fuse and simplify the boundary and return it to the vector set
“为机器立心”:朱松纯团队搭建人与机器人的价值双向对齐系统,解决人机协作领域的重大挑战