GENSCAN 1.0 Date run: 4-Nov-116 Time: 06:01:40 Sequence gi568815597r:19820310_20023559 : 203250 bp : 43.48% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 20224 20263 40 -2.66 1.01 Init + 24849 24909 61 2 1 93 76 11 0.417 1.81 1.02 Intr + 40183 40257 75 2 0 83 89 44 0.204 3.49 1.03 Term + 52096 52232 137 2 2 77 42 153 0.445 7.78 1.04 PlyA + 54540 54545 6 1.05 2.00 Prom + 56273 56312 40 -5.86 2.01 Init + 62205 62425 221 2 2 65 76 454 0.754 40.00 2.02 Intr + 70076 70224 149 2 2 68 78 96 0.691 6.48 2.03 Intr + 74059 74171 113 2 2 71 96 66 0.993 5.80 2.04 Intr + 77231 77353 123 1 0 108 76 185 0.983 20.08 2.05 Intr + 83964 84089 126 2 0 73 13 122 0.596 4.08 2.06 Intr + 84582 84678 97 2 1 70 92 81 0.987 6.28 2.07 Intr + 86123 86307 185 0 2 36 95 83 0.904 3.31 2.08 Term + 87261 87437 177 2 0 108 41 230 0.950 17.99 2.09 PlyA + 90103 90108 6 1.05 3.05 PlyA - 90343 90338 6 1.05 3.04 Term - 100140 99998 143 1 2 72 54 240 0.986 17.09 3.03 Intr - 102095 101989 107 1 2 68 78 136 0.850 10.46 3.02 Intr - 102446 102308 139 0 1 142 81 288 0.999 33.02 3.01 Init - 103250 103211 40 2 1 41 106 50 0.886 0.67 3.00 Prom - 104170 104131 40 -6.66 4.00 Prom + 104267 104306 40 -5.46 4.01 Init + 105043 105101 59 0 2 39 105 2 0.772 -2.22 4.02 Term + 106093 106294 202 1 1 59 50 383 0.831 28.46 4.03 PlyA + 107765 107770 6 1.05 5.13 PlyA - 108673 108668 6 1.05 5.12 Term - 110584 110475 110 2 2 31 47 98 0.646 -1.33 5.11 Intr - 112795 112705 91 1 1 48 101 171 0.716 13.97 5.10 Intr - 115106 115038 69 2 0 63 80 52 0.321 1.28 5.09 Intr - 132160 132070 91 0 1 74 89 53 0.293 4.00 5.08 Intr - 133969 133797 173 1 2 50 21 111 0.062 -0.66 5.07 Intr - 154117 153999 119 2 2 87 75 13 0.249 0.08 5.06 Intr - 154672 154631 42 2 0 92 81 23 0.334 0.21 5.05 Intr - 157812 157706 107 2 2 39 83 87 0.739 3.16 5.04 Intr - 158215 158071 145 2 1 87 85 151 0.982 14.04 5.03 Intr - 161604 161573 32 2 2 118 41 12 0.091 -2.73 5.02 Intr - 163580 163387 194 2 2 -12 81 150 0.219 2.59 5.01 Init - 164301 164140 162 0 0 88 80 149 0.628 11.88 5.00 Prom - 164819 164780 40 -3.06 6.02 PlyA - 164908 164903 6 1.05 6.01 Sngl - 172784 172269 516 2 0 21 43 209 0.496 5.74 6.00 Prom - 172919 172880 40 -4.96 7.00 Prom + 178292 178331 40 -2.26 7.01 Init + 186297 186497 201 2 0 60 86 95 0.089 5.38 7.02 Term + 192344 192526 183 1 0 39 38 123 0.215 -0.06 7.03 PlyA + 198318 198323 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 177765 178022 258 2 0 72 53 165 0.889 6.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:19820310_20023559|GENSCAN_predicted_peptide_1|90_aa MTERGGKKDTMFSKCDTEFKEKRREQQWEELAPFMAIWVVEQVSPGWGPGTGADDAVVNN EFTQDDDDCKYCGPLGSLAQTPNKDIAMEL >gi568815597r:19820310_20023559|GENSCAN_predicted_CDS_1|273_bp atgactgaaaggggaggaaagaaagacacaatgttttccaaatgtgatacagaattcaag gaaaagagaagagagcaacagtgggaagaacttgctccattcatggccatctgggtggtg gagcaggtgtctccagggtggggtcctggcacaggtgctgatgatgcagttgtaaataat gagtttacacaggatgacgatgactgcaaatactgtgggccattaggatctctggcacag acaccaaacaaagacattgctatggagctgtaa >gi568815597r:19820310_20023559|GENSCAN_predicted_peptide_2|396_aa MSRKQAAKSRPGSGSRKAEAERKRDERAARRALAKERRNRPESGGGGGCEEEFVSFANQL QALGLKLREVPGDGNCLFRALGDQLEGHSRNHLKHRQETVDYMIKQREDFEPFVEDDIPF EKHVASLAKPGTFAGNDAIVAFARNHQLNVVIHQLNAPLWQIRGTEKSSVRELHIAYRYG EHYDSVRRINDNSEAPAHLQTDMLHQDESNKREKIKTKGMDSEDDLRDEVEDAVQKVCNA TGCSDFNLIVQNLEAENYNIESAIIAVLRMNQGKRNNAEENLEPSGRVLKQCGPLWEEGG SGARIFGNQGLNEGRTENNKAQASPSEENKANKNQLAKVTNKQRREQQWMEKKKRQEERH RHKALESRGSHRDNNRSEAEANTQVTLVKTFAALNI >gi568815597r:19820310_20023559|GENSCAN_predicted_CDS_2|1191_bp atgtcccgaaagcaggcggcgaagagccggccgggcagcggcagccggaaagccgaggcc gagcgcaagcgggacgagcgggcggcgcgccgggccctggccaaggagcggcggaatcgg ccggagtctggcggcggcggcggctgcgaggaggagttcgtcagcttcgccaaccagctg caggccctggggctgaagctgcgggaggtgccgggggacggcaattgcttgttcagagct cttggtgatcaattggagggacactcacgaaatcatctcaagcacagacaggagacagtg gactacatgataaagcagcgggaagattttgaaccctttgtagaagatgacattcctttt gagaagcatgtggccagtttggcaaagcctggtacttttgctggcaatgatgcaattgta gcctttgcaagaaatcatcagttgaatgtagtgattcatcaacttaatgcccctttgtgg cagattcgtggtacagagaaaagcagcgtgagggagttacacatcgcatatcggtatgga gagcactacgacagtgttcggaggatcaatgacaactcagaggcacctgcacatctccag acggatatgcttcatcaagatgaatcaaataaaagagaaaagatcaagacaaagggaatg gactctgaagacgacctgagagatgaagtagaggatgctgtccagaaagtttgtaatgca actggatgttcagattttaatttaatagtccagaacctggaagctgaaaattataatatt gaatctgcaataattgccgtgcttcggatgaaccaagggaagagaaataatgcagaagag aatcttgagcccagtggtcgagtgctgaagcagtgtggccctttgtgggaggagggtggc agtggtgccagaatctttggaaatcagggcttaaatgaaggcaggaccgaaaacaataag gcacaggccagccctagtgaagaaaacaaagcaaataaaaaccagctcgcaaaggtcaca aacaaacagaggcgagaacagcagtggatggagaagaagaagcggcaggaggagaggcac cgccacaaagccctggagagcagaggtagccacagggacaataacagaagcgaagcagag gcgaacacgcaggtcaccttggtgaagaccttcgccgctctcaacatctga >gi568815597r:19820310_20023559|GENSCAN_predicted_peptide_3|142_aa MKSPHVLVFLCLLVALVTGNLVQFGVMIEKMTGKSALQYNDYGCYCGIGGSHWPVDQTDW CCHAHDCCYGRLEKLGCEPKLEKYLFSVSERGIFCAGRTTCQRLTCECDKRAALCFRRNL GTYNRKYAHYPNKLCTGPTPPC >gi568815597r:19820310_20023559|GENSCAN_predicted_CDS_3|429_bp atgaaatctccccacgtgctggtgttcctttgcctcctggtggctctggtcaccgggaac ctggttcagtttggggtgatgatcgagaagatgacaggcaagtccgccctgcagtacaac gactatggctgttactgcggcatcggtggctcccactggccggtggaccagactgactgg tgctgccacgcccacgactgctgctacgggcgtctggagaagctgggctgtgagcccaaa ctggaaaagtatcttttctctgtcagcgaacgtggcattttctgcgccggcaggaccacc tgccagcggctgacctgcgagtgtgacaagagggctgccctctgctttcgccgcaacctg ggcacctacaaccgcaaatatgcccattatcccaacaagctgtgcaccgggcccaccccg ccctgctga >gi568815597r:19820310_20023559|GENSCAN_predicted_peptide_4|86_aa MLGSSHPVSEKTCGEKGSESLEATTIIIIIITISIIIVNITIITITIIVITINITIIITI AITITIFITINIIYMVFICVPTQISH >gi568815597r:19820310_20023559|GENSCAN_predicted_CDS_4|261_bp atgctgggctcttctcaccctgtttcagagaagacttgcggggagaaggggtcagagagc ttagaagccaccaccatcatcatcatcataatcactatcagtatcatcatcgtcaatatt accatcatcaccatcaccatcattgtcatcaccatcaatattaccatcatcatcaccatt gccatcaccatcaccatcttcatcaccatcaatatcatttatatggtttttatctgtgtc cctacccaaatctcacattga >gi568815597r:19820310_20023559|GENSCAN_predicted_peptide_5|444_aa MPEPPLSAAVGSCAARASPTSAIPCSMAPSPIDHPRAEECGTQRSTCRQLHLRQVGSFTP EPVRPRTHQKEETPNTSEHQKEQTPDTPPLRTVTLTVRDRGFILEVSETKNPPIPVTVCV TTYHMMSFTSLLQAHGNLVNFHRMIKLTTGKEAALSYGFYGCHCGVGGRGSPKDATDRCC VTHDCCYKRLEKRGCGTKFLSYKFSNSGSRITCGNSSSYWPAGRKQEGFRWEIPTWTSYS PTCFSNNTHLGCWVWIPSQRHPAQGTQMVKEVDLRLSFHLLGCSTRLNPSSLAILVISVI GFLFGEQQDLHQTPGVSVTEFLTVWQWFSNLSKHRNYTEPCKKYFQGPIPGVSNSAPCIW TGIVTCFDQMDVAKVLLQDNDDDGGGDDDDDGGGVSGNDCDGDDSDSNVRQGPNEPYPDF IACLKDAAQKAILDAHVRETIVQL >gi568815597r:19820310_20023559|GENSCAN_predicted_CDS_5|1335_bp atgcctgagcctcccctctccgccgccgtgggctcctgcgcggcccgagcctcccccacg agcgccatcccctgctccatggcgcccagtcccattgaccacccaagggctgaggagtgt ggcacacagcgcagcacttgcaggcagctccacctgcggcaggtcggcagcttcactcct gagccagtgagaccacgaacccaccagaaggaagaaactccaaacacatctgaacatcag aaggaacaaactccggacacgccgcctttaagaactgtaacactcactgtgagggaccgc ggcttcattcttgaagtcagtgagaccaagaacccaccaattccggtcacagtatgtgtc accacctatcacatgatgtcatttactagcctactgcaggcccatgggaatttggtgaat ttccacagaatgatcaagttgacgacaggaaaggaagccgcactcagttatggcttctac ggctgccactgtggcgtgggtggcagaggatcccccaaggatgcaacggatcgctgctgt gtcactcatgactgttgctacaaacgtctggagaaacgtggatgtggcaccaaatttctg agctacaagtttagcaactcggggagcagaatcacctgtggcaatagcagctcctactgg cctgctgggagaaagcaggaagggttccgctgggaaattcccacctggaccagctacagc cccacctgtttctccaataacacgcatctgggctgctgggtgtggatcccttcacaaagg caccctgcacaggggacacagatggtcaaggaggtggatttgagactgagttttcatctc cttggctgcagcacccgattaaatccttcttccttggcaatacttgtcatctcagtgatt ggctttctgtttggcgagcagcaggacctacatcaaacccctggtgtttcagtaacagaa tttctaacagtgtggcagtggttctccaacttgagcaagcatcggaattacacggagcct tgtaaaaaatatttccagggcccaatccctggagtttctaattcagctccttgcatctgg actggcattgtaacttgctttgatcaaatggatgtggcaaaagtgctgctgcaggataat gatgatgatggtggtggtgatgatgatgatgatggtggtggtgttagtggtaatgattgt gatggtgatgacagtgatagtaatgtcagacagggccctaatgaaccttacccagacttc attgcctgcctaaaagatgcagctcaaaaggctatcttggacgcacatgtccgagagaca attgtccaactatag >gi568815597r:19820310_20023559|GENSCAN_predicted_peptide_6|171_aa MRQKIKKDIQDLNAALDQEDLIDIYRTLHHKSRDYTFFSVPHSTYSKIDHIIGNKTLLSR CKRIEIITVSLSDHSAIKLELRIKKLSQNCTTTWKLNNLLLNDYWVTNKIKAEITKFFET SEKKETTYQNLWDTFKAVCKGKFVTLNAHIRKQETSKIGTLILQLKELEKQ >gi568815597r:19820310_20023559|GENSCAN_predicted_CDS_6|516_bp atgagacagaaaattaaaaaggacattcaggacttgaacgcagctctggaccaagaggac ctaatagatatctacagaaccctccaccacaaatcaagagactatacgttcttctcagta ccacatagcacttattctaaaatcgaccacataattggaaataaaacactcctcagcaga tgcaaaagaatagaaatcataacagtcagcctctcagaccatagtgcaataaaattagaa ctaaggattaagaaactcagtcaaaactgcacaactacatggaaactgaacaaccttctc ctgaatgactactgggtaaccaacaaaattaaagcagaaataacgaagttctttgaaacc agtgagaaaaaagagacaacataccagaatctctgggacacatttaaagcagtgtgtaaa ggcaaatttgtaacactaaatgcccacatcagaaagcaggaaacatctaaaattggcacc cttatattacaattaaaagaactagagaagcaatag >gi568815597r:19820310_20023559|GENSCAN_predicted_peptide_7|127_aa MSELLFTIATKRIKYPGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGIMNIM KMAIMPKFCGLWGDPTVMVPPDILMESLCGSLDAMAPLGIALVEAVCSGPTPVAVLFLGP KALWDIL >gi568815597r:19820310_20023559|GENSCAN_predicted_CDS_7|384_bp atgagtgaactcctattcacaattgctacaaagagaataaaatacccaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggacacaaataaatggaagaacattccatgctcatggataggaataatgaatatcatg aaaatggccataatgcccaagttttgtggtctttggggtgaccccactgtcatggttcca ccagacatccttatggagtctctctgtggcagtcttgatgccatggccccactgggcatt gccttagtggaggctgtctgcagcggccccacccctgtggcagttctcttcctgggccct aaggctctctgggacatactttga