Skip to content Skip to sidebar Skip to footer

Reading Data From Specially Formatted Text File

I am using this method, kindly suggested by Ashwini Chaudhary, to assign data to a dictionary from a text file that is in a specific format. keys = map(str.strip, next(f).split('Ke

Solution 1:

Something like this:

import re
withopen('abc') as f:
    for line in f:
        if line.startswith('Key'):
            keys = re.search(r'Key\s+(.*)',line).group(1).split("\t")
        elif line.startswith(('Word','Letter')):
            vals = re.search(r'(Word|Letter)\s+(.*)',line).group(2).split("\t")

    printdict(zip(keys,vals))

abc:

LineHere  w    x    y    z
Key       a1b2  c 3  d 4
OrHere    00011011
Word      as   box  cow  dig

output is :

{'d 4': 'dig', 'b 2': 'box', 'a 1': 'as', 'c 3': 'cow'}

abc:

LineHere  w    x    y    z
Key       a1b2  c 3  d 4
OrHere    00011011
Letter    AB    C    D

output is :

{'d 4': 'D', 'b 2': 'B', 'a 1': 'A', 'c 3': 'C'}

Solution 2:

ss = '''LineHere  w    x    y    z
Key       a 1  b 2  c 3  d 4
OrHere    00   01   10   11
Word      as   box  cow  dig
'''import re

rgx = re.compile('Key +(.*)\r?\n''(?:.*\r?\n)?''(?:Word|Letter) +(.*)\r?\n')

mat = rgx.search(ss)
keys = mat.group(1).split(' ')
words = mat.group(2).split('\t')

You'll obtain ss by reading your file:

withopen (filename) as f:
    ss = f.read()

Edit

Well, if all the lines have data separated with tabs, you can do:

ss = '''LineHere  w\tx\ty\tz
Key       a 1\tb 2\tc 3\td 4
OrHere    00\t01\t10\t11
Word      as\tbox\tcow\tdig
'''import re

rgx = re.compile('Key +(.*)\r?\n''(?:.*\r?\n)?''(?:Word|Letter) +(.*)\r?\n')

printdict(zip(*map(lambda x: x.split('\t'),
                     rgx.search(ss).groups())))

Post a Comment for "Reading Data From Specially Formatted Text File"