GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:08:53 Sequence gi568815597r:19875704_20078773 : 203070 bp : 43.64% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 6811 7031 221 0 2 65 76 454 0.708 40.00 1.02 Intr + 14682 14830 149 0 2 68 78 96 0.690 6.48 1.03 Intr + 18665 18777 113 0 2 71 96 66 0.993 5.80 1.04 Intr + 21837 21959 123 2 0 108 76 185 0.983 20.08 1.05 Intr + 28570 28695 126 0 0 73 13 122 0.596 4.08 1.06 Intr + 29188 29284 97 0 1 70 92 81 0.987 6.28 1.07 Intr + 30729 30913 185 1 2 36 95 83 0.904 3.31 1.08 Term + 31867 32043 177 0 0 108 41 230 0.950 17.99 1.09 PlyA + 34709 34714 6 1.05 2.05 PlyA - 34949 34944 6 1.05 2.04 Term - 44746 44604 143 2 2 72 54 240 0.986 17.09 2.03 Intr - 46701 46595 107 2 2 68 78 136 0.850 10.46 2.02 Intr - 47052 46914 139 1 1 142 81 288 0.999 33.02 2.01 Init - 47856 47817 40 0 1 41 106 50 0.886 0.67 2.00 Prom - 48776 48737 40 -6.66 3.00 Prom + 48873 48912 40 -5.46 3.01 Init + 49649 49707 59 1 2 39 105 2 0.772 -2.22 3.02 Term + 50699 50900 202 2 1 59 50 383 0.831 28.46 3.03 PlyA + 52371 52376 6 1.05 4.13 PlyA - 53279 53274 6 1.05 4.12 Term - 55190 55081 110 0 2 31 47 98 0.646 -1.33 4.11 Intr - 57401 57311 91 2 1 48 101 171 0.716 13.97 4.10 Intr - 59712 59644 69 0 0 63 80 52 0.321 1.28 4.09 Intr - 76766 76676 91 1 1 74 89 53 0.293 4.00 4.08 Intr - 78575 78403 173 2 2 50 21 111 0.062 -0.66 4.07 Intr - 98723 98605 119 0 2 87 75 13 0.249 0.08 4.06 Intr - 99278 99237 42 0 0 92 81 23 0.334 0.21 4.05 Intr - 102418 102312 107 0 2 39 83 87 0.739 3.16 4.04 Intr - 102821 102677 145 0 1 87 85 151 0.982 14.04 4.03 Intr - 106210 106179 32 0 2 118 41 12 0.091 -2.73 4.02 Intr - 108186 107993 194 0 2 -12 81 150 0.219 2.59 4.01 Init - 108907 108746 162 1 0 88 80 149 0.628 11.88 4.00 Prom - 109425 109386 40 -3.06 5.02 PlyA - 109514 109509 6 1.05 5.01 Sngl - 117390 116875 516 0 0 21 43 209 0.496 5.74 5.00 Prom - 117525 117486 40 -4.96 6.00 Prom + 122898 122937 40 -2.26 6.01 Init + 130903 131103 201 0 0 60 86 95 0.091 5.38 6.02 Term + 136950 137132 183 2 0 39 38 123 0.222 -0.06 6.03 PlyA + 142924 142929 6 1.05 7.03 PlyA - 143621 143616 6 1.05 7.02 Term - 149191 149015 177 1 0 37 49 133 0.659 1.99 7.01 Init - 151557 151498 60 0 0 74 82 30 0.510 2.25 7.00 Prom - 153817 153778 40 0.34 8.00 Prom + 177692 177731 40 -2.56 8.01 Init + 198320 198362 43 1 1 72 69 58 0.254 2.88 8.02 Term + 201058 201173 116 2 2 86 33 108 0.324 3.83 8.03 PlyA + 201216 201221 6 1.05 9.02 PlyA - 201385 201380 6 -1.75 9.01 Term - 202282 202099 184 0 1 72 49 154 0.739 6.92 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 122371 122628 258 0 0 72 53 165 0.889 6.53 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_1|396_aa MSRKQAAKSRPGSGSRKAEAERKRDERAARRALAKERRNRPESGGGGGCEEEFVSFANQL QALGLKLREVPGDGNCLFRALGDQLEGHSRNHLKHRQETVDYMIKQREDFEPFVEDDIPF EKHVASLAKPGTFAGNDAIVAFARNHQLNVVIHQLNAPLWQIRGTEKSSVRELHIAYRYG EHYDSVRRINDNSEAPAHLQTDMLHQDESNKREKIKTKGMDSEDDLRDEVEDAVQKVCNA TGCSDFNLIVQNLEAENYNIESAIIAVLRMNQGKRNNAEENLEPSGRVLKQCGPLWEEGG SGARIFGNQGLNEGRTENNKAQASPSEENKANKNQLAKVTNKQRREQQWMEKKKRQEERH RHKALESRGSHRDNNRSEAEANTQVTLVKTFAALNI >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_1|1191_bp atgtcccgaaagcaggcggcgaagagccggccgggcagcggcagccggaaagccgaggcc gagcgcaagcgggacgagcgggcggcgcgccgggccctggccaaggagcggcggaatcgg ccggagtctggcggcggcggcggctgcgaggaggagttcgtcagcttcgccaaccagctg caggccctggggctgaagctgcgggaggtgccgggggacggcaattgcttgttcagagct cttggtgatcaattggagggacactcacgaaatcatctcaagcacagacaggagacagtg gactacatgataaagcagcgggaagattttgaaccctttgtagaagatgacattcctttt gagaagcatgtggccagtttggcaaagcctggtacttttgctggcaatgatgcaattgta gcctttgcaagaaatcatcagttgaatgtagtgattcatcaacttaatgcccctttgtgg cagattcgtggtacagagaaaagcagcgtgagggagttacacatcgcatatcggtatgga gagcactacgacagtgttcggaggatcaatgacaactcagaggcacctgcacatctccag acggatatgcttcatcaagatgaatcaaataaaagagaaaagatcaagacaaagggaatg gactctgaagacgacctgagagatgaagtagaggatgctgtccagaaagtttgtaatgca actggatgttcagattttaatttaatagtccagaacctggaagctgaaaattataatatt gaatctgcaataattgccgtgcttcggatgaaccaagggaagagaaataatgcagaagag aatcttgagcccagtggtcgagtgctgaagcagtgtggccctttgtgggaggagggtggc agtggtgccagaatctttggaaatcagggcttaaatgaaggcaggaccgaaaacaataag gcacaggccagccctagtgaagaaaacaaagcaaataaaaaccagctcgcaaaggtcaca aacaaacagaggcgagaacagcagtggatggagaagaagaagcggcaggaggagaggcac cgccacaaagccctggagagcagaggtagccacagggacaataacagaagcgaagcagag gcgaacacgcaggtcaccttggtgaagaccttcgccgctctcaacatctga >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_2|142_aa MKSPHVLVFLCLLVALVTGNLVQFGVMIEKMTGKSALQYNDYGCYCGIGGSHWPVDQTDW CCHAHDCCYGRLEKLGCEPKLEKYLFSVSERGIFCAGRTTCQRLTCECDKRAALCFRRNL GTYNRKYAHYPNKLCTGPTPPC >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_2|429_bp atgaaatctccccacgtgctggtgttcctttgcctcctggtggctctggtcaccgggaac ctggttcagtttggggtgatgatcgagaagatgacaggcaagtccgccctgcagtacaac gactatggctgttactgcggcatcggtggctcccactggccggtggaccagactgactgg tgctgccacgcccacgactgctgctacgggcgtctggagaagctgggctgtgagcccaaa ctggaaaagtatcttttctctgtcagcgaacgtggcattttctgcgccggcaggaccacc tgccagcggctgacctgcgagtgtgacaagagggctgccctctgctttcgccgcaacctg ggcacctacaaccgcaaatatgcccattatcccaacaagctgtgcaccgggcccaccccg ccctgctga >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_3|86_aa MLGSSHPVSEKTCGEKGSESLEATTIIIIIITISIIIVNITIITITIIVITINITIIITI AITITIFITINIIYMVFICVPTQISH >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_3|261_bp atgctgggctcttctcaccctgtttcagagaagacttgcggggagaaggggtcagagagc ttagaagccaccaccatcatcatcatcataatcactatcagtatcatcatcgtcaatatt accatcatcaccatcaccatcattgtcatcaccatcaatattaccatcatcatcaccatt gccatcaccatcaccatcttcatcaccatcaatatcatttatatggtttttatctgtgtc cctacccaaatctcacattga >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_4|444_aa MPEPPLSAAVGSCAARASPTSAIPCSMAPSPIDHPRAEECGTQRSTCRQLHLRQVGSFTP EPVRPRTHQKEETPNTSEHQKEQTPDTPPLRTVTLTVRDRGFILEVSETKNPPIPVTVCV TTYHMMSFTSLLQAHGNLVNFHRMIKLTTGKEAALSYGFYGCHCGVGGRGSPKDATDRCC VTHDCCYKRLEKRGCGTKFLSYKFSNSGSRITCGNSSSYWPAGRKQEGFRWEIPTWTSYS PTCFSNNTHLGCWVWIPSQRHPAQGTQMVKEVDLRLSFHLLGCSTRLNPSSLAILVISVI GFLFGEQQDLHQTPGVSVTEFLTVWQWFSNLSKHRNYTEPCKKYFQGPIPGVSNSAPCIW TGIVTCFDQMDVAKVLLQDNDDDGGGDDDDDGGGVSGNDCDGDDSDSNVRQGPNEPYPDF IACLKDAAQKAILDAHVRETIVQL >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_4|1335_bp atgcctgagcctcccctctccgccgccgtgggctcctgcgcggcccgagcctcccccacg agcgccatcccctgctccatggcgcccagtcccattgaccacccaagggctgaggagtgt ggcacacagcgcagcacttgcaggcagctccacctgcggcaggtcggcagcttcactcct gagccagtgagaccacgaacccaccagaaggaagaaactccaaacacatctgaacatcag aaggaacaaactccggacacgccgcctttaagaactgtaacactcactgtgagggaccgc ggcttcattcttgaagtcagtgagaccaagaacccaccaattccggtcacagtatgtgtc accacctatcacatgatgtcatttactagcctactgcaggcccatgggaatttggtgaat ttccacagaatgatcaagttgacgacaggaaaggaagccgcactcagttatggcttctac ggctgccactgtggcgtgggtggcagaggatcccccaaggatgcaacggatcgctgctgt gtcactcatgactgttgctacaaacgtctggagaaacgtggatgtggcaccaaatttctg agctacaagtttagcaactcggggagcagaatcacctgtggcaatagcagctcctactgg cctgctgggagaaagcaggaagggttccgctgggaaattcccacctggaccagctacagc cccacctgtttctccaataacacgcatctgggctgctgggtgtggatcccttcacaaagg caccctgcacaggggacacagatggtcaaggaggtggatttgagactgagttttcatctc cttggctgcagcacccgattaaatccttcttccttggcaatacttgtcatctcagtgatt ggctttctgtttggcgagcagcaggacctacatcaaacccctggtgtttcagtaacagaa tttctaacagtgtggcagtggttctccaacttgagcaagcatcggaattacacggagcct tgtaaaaaatatttccagggcccaatccctggagtttctaattcagctccttgcatctgg actggcattgtaacttgctttgatcaaatggatgtggcaaaagtgctgctgcaggataat gatgatgatggtggtggtgatgatgatgatgatggtggtggtgttagtggtaatgattgt gatggtgatgacagtgatagtaatgtcagacagggccctaatgaaccttacccagacttc attgcctgcctaaaagatgcagctcaaaaggctatcttggacgcacatgtccgagagaca attgtccaactatag >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_5|171_aa MRQKIKKDIQDLNAALDQEDLIDIYRTLHHKSRDYTFFSVPHSTYSKIDHIIGNKTLLSR CKRIEIITVSLSDHSAIKLELRIKKLSQNCTTTWKLNNLLLNDYWVTNKIKAEITKFFET SEKKETTYQNLWDTFKAVCKGKFVTLNAHIRKQETSKIGTLILQLKELEKQ >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_5|516_bp atgagacagaaaattaaaaaggacattcaggacttgaacgcagctctggaccaagaggac ctaatagatatctacagaaccctccaccacaaatcaagagactatacgttcttctcagta ccacatagcacttattctaaaatcgaccacataattggaaataaaacactcctcagcaga tgcaaaagaatagaaatcataacagtcagcctctcagaccatagtgcaataaaattagaa ctaaggattaagaaactcagtcaaaactgcacaactacatggaaactgaacaaccttctc ctgaatgactactgggtaaccaacaaaattaaagcagaaataacgaagttctttgaaacc agtgagaaaaaagagacaacataccagaatctctgggacacatttaaagcagtgtgtaaa ggcaaatttgtaacactaaatgcccacatcagaaagcaggaaacatctaaaattggcacc cttatattacaattaaaagaactagagaagcaatag >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_6|127_aa MSELLFTIATKRIKYPGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWIGIMNIM KMAIMPKFCGLWGDPTVMVPPDILMESLCGSLDAMAPLGIALVEAVCSGPTPVAVLFLGP KALWDIL >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_6|384_bp atgagtgaactcctattcacaattgctacaaagagaataaaatacccaggaatccaactt acaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaacgaaataaaa gaggacacaaataaatggaagaacattccatgctcatggataggaataatgaatatcatg aaaatggccataatgcccaagttttgtggtctttggggtgaccccactgtcatggttcca ccagacatccttatggagtctctctgtggcagtcttgatgccatggccccactgggcatt gccttagtggaggctgtctgcagcggccccacccctgtggcagttctcttcctgggccct aaggctctctgggacatactttga >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_7|78_aa MNNIDRPLARFTKKRREKIQMQKTRLTKSRIIKIKTFRKISIEGTYLKIIKAIYDKPTVN IILNKEKLKAFPLRTGTR >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_7|237_bp atgaacaatattgatagaccattagcaagattcaccaagaaaagaagagagaagatccaa atgcagaaaactcgtttgacaaaatcaagaatcataaagattaaaaccttcagaaaaatc agcatagaagggacatacctcaagataataaaagccatctatgacaagcccacagtcaac attatactgaacaaggaaaagttgaaagcatttcccctgagaactggaacaagataa >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_8|52_aa MTVSAMKGKNRVTLGNARVDSISDEFYNIPETGKLPVERRIMEDFPEEEDLM >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_8|159_bp atgacggtcagtgccatgaaggggaaaaacagggtgaccttgggaaatgccagagtggac agcatatcagatgagttctataacattccagagacaggtaagctccctgtggaaagaagg atcatggaagacttcccggaggaggaggacttgatgtag >gi568815597r:19875704_20078773|GENSCAN_predicted_peptide_9|61_aa XHSLDELIRVQGFSSYFYAGIRGCPGQRSLALALTQQTMLNEGMTEYSNQEKAQCQIQDP T >gi568815597r:19875704_20078773|GENSCAN_predicted_CDS_9|186_bp nnccactctctggatgagctcattcgagttcagggatttagctcttatttctatgcaggc atccgggggtgtcccggacagagaagtttggccctagctttgactcaacaaacaatgctt aatgaaggaatgactgaatacagcaaccaggagaaagcacaatgtcagatccaagatcca acgtag