Python string manipulation -- performance problems -

- February 15, 2014

I have the code of the following code that I execute in my application almost 2 million times so that several records can be parsed . This part is being hampered and I was thinking that if there are any such nifty moves which can help me by suggesting that these simple string manipulals are sharp.

try: data = [] for start = 0 end = 0 = column (): end = start + (info.columnLength) piece = line [start: end] if the piece == '' or lane (piece)! = Info.columnLength: Increase the 'wrong input' if info.hasSignage: if (slice [0: 1] .stip ()! = '+' And slice [0: 1]. Stream ()! = '-'): Increase "wrong input" if not info.skipColumn: Data.append (slice) start = end parsedLine = excluding data: parsedLine = false

Edit: I'm changing this answer a bit. I will leave the original answer below.

In my other answer, I have commented that the best thing to do is to find an underlying Python module which will have to be unpacking. I could not think of one, but maybe I should have been searching Google for one. @John Machinon gave an answer that showed how to do this: Use the Python struct module as it is written in C, so it should be faster than my pure Python solution. (I have not actually measured anything, so it is estimated.)

I agree that the argument in the original code is "unpathic" is not best to return a watchful value; Better to return a valid value or increase exceptions Another way to do this is to return a list of valid values, as well as a second list of invalid values. Since @John Menon had offered the code to present valid values, I thought I would write a version here, which gives two lists.

Note: Perhaps the best possible answer is to answer John Manchen's answer and modify it possibly to save the invalid values in a file for later review. Answers one at a time, so there is no need to make a large list of purse records; And saving bad lines to disk means that there is no need to make possibly the largest list of bad lines.

  import structure def parse_records (self): "" "A Tulip Gives: (Good, Bad) is a list of good valid records (as Tuples) is a list of bad tuples (Onboard LINE_NAME, line fault) "" "cols = self.Columns () unpack_fmt =" "sign_checks = [] start = 0 Colx, information enumerate (column, 1): clen = info.columnLength if clen & lt; 1: Increase ValueError ( "column% d: bad column Lang% r"% (colx, clen)) that info.skip column: unpack_fmt + = str (clen) + "x" else: unpack_fmt + = str (clen) + "s" If info.hasSignage: sign_checks.append (Start) Start + = Clen Apekshit_elan = start unpack = struct.Struct (unpack_fmt) Kanupak good = [] bad = [] line for Lain_anu, enumerate (self.which_the_list_of_lines_is , 1): If LAN (line)! = Apekshit_len: Krab.apend (Lain_anyuem line, "bad" length)) If all are (row [i] in sign_checks for I the '+ -'): bad.append ((   original answer text: This answer should be very fast if  is itself. Column information is same on all records. Once processed, and create two lists in which our records need to be processed.  
 This code shows That's how to calculate the  parse list , but does not really produce it or returns it or does anything with it, obviously you will need to change it.  
  def parse_records (self): cols = self.Column () slice = [] sign_checks = [] start for information cols = 0: if info column Lang & lt; 1: Increase ValueError, " Bad column lang "end = start + info.columnLength if the information is not. SkipColumn: tup = (start, end) slice. Append (tup) if info.hasSignage: sign_checks.append (start ) Using expect_len = end # or to try an interval (end -1): For the line itself. Whatever _the_list_of_lines_is: if LAN (line)! = Expected_Line: Increase value, if not all (in line [i] in sign_checks for '+ -' I): Increase valueError, "wrong input" parsedline = [for line [s: e] , In slice] ValueError: parsedLine = False




















Get link





Facebook





X





Pinterest





Email





Other Apps




Comments





Post a Comment



Popular posts from this blog




php - Creating canonical URLs with custom route-classes -



-



March 15, 2015








    I am trying to implement the canonical url and to combine with custom route classes.   URL-SACAME is something like this:    / category-x / article / 123 / category-y / article / 123    I am creating a custom route - the expansion of the class  Zend_Controller_Router_Route_Regex  and checks that the article is 123 and the correct category-name is included in the URL. If the article is in 123 square-X and the user is reaching square-y, then I want to redirect to the correct URL.   But there is no clear possibility to do this directly to the roads. What is the best practice approach here?      I often do this in my action controller like something ...    // Get / Category-Y / Article / 123 $ $ article- & gt; URL is generated, and it contains / category-x / article / 123 if (this-> --request-> getRequestUri ()! = $ Article-> URL) {return $ this- & gt; ; _helper-> Redirector-> GoToUrl ($ paragraph-> url); }    In this example, $ article-> The url will need ...





Read more





mysql - BLOB/TEXT column 'value' used in key specification without a
key length -



-



February 15, 2013








    I have developed an extension which works up to 1.6 on Magren (I'm trying Enterprise Edition, And I think the community is the same problem, because it is the same code). In my install script, I see the  $ installer-> gt; CreateEntityTables ($ this- & gt; getTable ('alphanum / info'));  The installation is done until it is not in the _text unit table. It crashed there! It turns out that when I log in to SQL and run it via PHPmyadmin, then this error is:  Blob / Text column 'value' is used without the key 'key' . I saw the code there, and this is what is trying to create an index on the value column:    -> addIndex ($ this- & gt; getIdxName ($ eavTableName, array ( 'attribute_id array (' attribute_id ',' value ')) - & gt; addIndex ($ this- & gt; getIdxName ($ eavTableName, array (' entity_type_id ' , 'Value')), array ('entity_type_id', 'value'))    If there is no  if  statement is n...





Read more





mysql - php global within a class but outside a function? -



-



March 15, 2012








    I can do $ conn from within my function such as:    function Xyz ($ A) {global $ conn; ....}    I am thinking that this is a way of doing a class before.    class abc {global $ conn; Public function xyz ($ a) {$ conn- & gt; Execute ...} public function xya ($ a) {$ conn- & gt; Execute ...}} The above method gives me an error I know that I can get it this way:    class ABC {Public Function xyz ($ a) {global $ conn; $ Conn- & gt; Execute ...} Public Function xya ($ a) {global $ conn; $ Conn- & gt; Execute ...}}    I hope you can understand what I mean. Thanks in advance for the help.      Your second example is possible, your first is not. There is no such thing as a member of a class variable that is a global variable, or the use of scope like yours is your first example. Generally, however, I would advise to avoid global use. Keyword I absolutely do a lot of PHP programming and have never used it in serious (actually, nothing in the last 10 or so years). .   If you ...





Read more

Search This Blog

Lay Page

Python string manipulation -- performance problems -

Comments

Post a Comment

Popular posts from this blog

php - Creating canonical URLs with custom route-classes -

mysql - BLOB/TEXT column 'value' used in key specification without a key length -

mysql - php global within a class but outside a function? -