Comments (3)
If the option accepts perl-compatible regex, then this is possible and simple:
--replace-message-text 'qr{^.{1000000,1000000000}$}s'
the above will match msg lines that are > 1Mb and < 1Gb in size.
Note that we found another instance where a single short msg line has been
repeated millions of times.
Thanks
Richard
On Mon, Jul 15, 2013 at 12:29:33AM -0700, Roberto Tyley wrote:
@chenri mentions in #20 that his repo contains a very large commit message:
due to someone repeating a short line a huge number of times (editing error?) so that a single msg line was 6Mb long. We never detected this until we tried bfg last week.
It would be nice if The BFG had the ability to somehow reduce the size of large
commit messages - but how exactly would it do it? Some options:• Simply truncate the entire message after the first X KB?
• Allow user to run a --replace-message-text option, similar to
--replace-text?? This would work for a repeated value on a single line, so
long as the line wasn't too long, but how would it work on a commit message
with 1 million lines, each distinct?—
Reply to this email directly or view it on GitHub.*
from bfg-repo-cleaner.
The BFG uses regex expressions provided by the Java SDK, which are quite extensive:
http://docs.oracle.com/javase/7/docs/api/java/util/regex/Pattern.html
I'll hopefully get some time in the next 24 hours to look at this.
from bfg-repo-cleaner.
Hi Roberto,
Did you get a chance to work on this? It would be really
nice if bfg-repo-cleaner can handle huge msg size.
Thanks
Richard
On Tue, Jul 16, 2013 at 02:09:46AM -0700, Roberto Tyley wrote:
The BFG uses regex expressions provided by the Java SDK, which are quite
extensive:http://docs.oracle.com/javase/7/docs/api/java/util/regex/Pattern.html
I'll hopefully get some time in the next 24 hours to look at this.
—
Reply to this email directly or view it on GitHub.*
from bfg-repo-cleaner.
Related Issues (20)
- how to download the .jar file? HOT 1
- Transfer GitHub account
- Can't build from source HOT 2
- Password protected: https://repository.sonatype.org/
- Disable pruning of what has been processed before
- Verifying commits HOT 3
- Cleanup only takes place after second run HOT 1
- How fix "Cleaning commits: 92% (1199/1303)java.lang.reflect.InvocationTargetException"? HOT 1
- Just thanks for helping god for you
- Including link: https://github.com/jarhot1992/Remote-ADB/issues
- Including link: https://rtyley.github.io/
- Including link: https://rtyley.github.io/
- Including link: https://rtyley.github.io/
- Including link: https://rtyley.github.io/
- Including link: https://rtyley.github.io/
- fft2d.tgz (53KB) updated: 2006/12/28
- How to build the tool? HOT 1
- Can I tag a commit as "protected"?
- Delete files with no extension
- Can not push - remote: GitLab: You cannot create a branch with an invalid name
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bfg-repo-cleaner.