GENSCAN 1.0 Date run: 6-Nov-116 Time: 03:26:09 Sequence gi568815597r:19714421_19915101 : 200681 bp : 45.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2313 2465 153 2 0 87 80 143 0.832 13.67 1.02 Term + 4394 4582 189 1 0 47 33 97 0.425 -2.45 1.03 PlyA + 6219 6224 6 1.05 2.20 PlyA - 8035 8030 6 1.05 2.19 Term - 16760 16660 101 0 2 79 38 52 0.265 -2.41 2.18 Intr - 23036 22952 85 2 1 111 105 39 0.972 7.29 2.17 Intr - 25540 25404 137 2 2 96 82 91 0.817 9.59 2.16 Intr - 26521 26357 165 2 0 67 119 197 0.968 20.63 2.15 Intr - 31231 31112 120 2 0 50 56 171 0.758 10.57 2.14 Intr - 32179 32036 144 2 0 134 110 131 0.999 20.15 2.13 Intr - 32840 32743 98 1 2 100 109 94 0.400 12.35 2.12 Intr - 38150 38007 144 1 0 70 55 60 0.394 0.30 2.11 Intr - 41346 41214 133 1 1 43 43 177 0.650 8.40 2.10 Intr - 56149 56122 28 1 1 128 115 25 0.573 6.89 2.09 Intr - 57062 56888 175 1 1 34 86 126 0.182 6.94 2.08 Intr - 58037 57753 285 1 0 110 43 108 0.014 4.96 2.07 Intr - 66346 66160 187 2 1 66 109 124 0.494 11.05 2.06 Intr - 71083 70986 98 0 2 51 97 85 0.967 5.35 2.05 Intr - 72697 72606 92 1 2 111 106 7 0.778 3.59 2.04 Intr - 77290 77111 180 1 0 46 66 98 0.159 3.46 2.03 Intr - 82014 81898 117 0 0 67 92 23 0.592 1.26 2.02 Intr - 82960 82883 78 1 0 122 84 10 0.675 3.75 2.01 Init - 85757 85692 66 2 0 36 59 124 0.736 3.38 2.00 Prom - 90155 90116 40 -2.36 3.03 PlyA - 91580 91575 6 1.05 3.02 Term - 100801 99998 804 1 0 -5 48 888 0.260 68.73 3.01 Init - 111591 111553 39 0 0 64 92 59 0.446 4.09 3.00 Prom - 113014 112975 40 -3.46 4.00 Prom + 126113 126152 40 -2.66 4.01 Init + 130738 130798 61 0 1 93 76 11 0.415 1.81 4.02 Intr + 146072 146146 75 0 0 83 89 44 0.204 3.49 4.03 Term + 157985 158121 137 0 2 77 42 153 0.445 7.78 4.04 PlyA + 160429 160434 6 1.05 5.00 Prom + 162162 162201 40 -5.86 5.01 Init + 168094 168314 221 0 2 65 76 454 0.754 40.00 5.02 Intr + 175965 176113 149 0 2 68 78 96 0.691 6.48 5.03 Intr + 179948 180060 113 0 2 71 96 66 0.993 5.80 5.04 Intr + 183120 183242 123 2 0 108 76 185 0.983 20.08 5.05 Intr + 189853 189978 126 0 0 73 13 122 0.596 4.08 5.06 Intr + 190471 190567 97 0 1 70 92 81 0.987 6.28 5.07 Intr + 192012 192196 185 1 2 36 95 83 0.904 3.31 5.08 Term + 193150 193326 177 0 0 108 41 230 0.995 17.99 5.09 PlyA + 195992 195997 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:19714421_19915101|GENSCAN_predicted_peptide_1|113_aa AGRADGGDTTCKGPDLKMVLHPSLGAVQQVSMEALSLGIHAGAASLKEAERFGEEQVFST KFLPRNAAAPTKNHASHCSLSLRSRYNNNPMKQLLTTHNVKPLKRKSNLRLAT >gi568815597r:19714421_19915101|GENSCAN_predicted_CDS_1|342_bp gctggccgtgcagatggaggagacaccacttgcaaagggcccgacctgaaaatggtactt catccttctcttggtgcagttcaacaggtatccatggaggccctgtccctgggcatccat gctggagccgccagcctcaaggaagctgagcggtttggagaggagcaggtattctccacc aaatttctgccgaggaatgcggcggcaccaactaaaaatcatgcttcccattgctccctg agcctgagatccaggtacaacaacaatcccatgaagcaactgctaactacacacaacgta aaaccactcaaacgaaaatccaacttgagactcgccacctag >gi568815597r:19714421_19915101|GENSCAN_predicted_peptide_2|810_aa MWRRGLAGHLQRPAPLSRWVVEVTSSHSSHFSPSQTCTPNAQATRLLLGDRHISNRPLCP RRGGGAQTKSLELQSWQAPHSLDKSREGTEVLLVLTEGAFSRRRNPALSWNQCNFSDTLE IQFYFGLDPDPKSDQGVEAPKNQYTQQGNLHISPKNHPEPEQPVFRQREAHGCFLKSGKY FLNAYSDPGPVQKARGVAVKKQPGNCLVHTALGMAMWNRPCQRLPQQPLVAEPTAEGEPH LPTGRELTEANRFAYAALCGISLSQLFPEPEHRAAPSHPPLGTPVGLHSDAHYGQRLAVA VHVEGTLPFTSSTFAAASVFQKARPWQHPGEQMAFSGKPFLVAAVTSAGGSGAPSVGCTE LISTSVLGSFCTEFMAGLVQWLELSEAVLPTMTAFASGLGGEGADVFVQILLKDPILKDD PTVITQDLLSFSLKDGHYDARARVLVCHMTSLLQVPLEELDVLEEMFLESLKEIKEEESD TKKSDLKTEGGGEDKDRKHPLMGDKMTESKTDLWIYEMGNAGVLLFRRMAEASRKKKENR RKWKRYLLIGLATVGGGTVIGVTGGLAAPLVAAGAATIIGSAGAAALGSAAGIAIMTSLF GAAGAGLTGYKMKKRVGAIEEFTFLPLTEGRQLHITIAVTGWLASGKYRTFSAPWAALAH SREQYCLAWEAKYLMELGNALETILSGLANMVAQEALKYTVLSGIVAALTWPASLLSVAN VIDNPWGVCLHRSAEVGKHLAHILLSRQQGRRPVTLIGFSLGARVIYFCLQEMAQEKAPP TLFLNHSPPKSLPLYYQWGIFTPPDGRDRQ >gi568815597r:19714421_19915101|GENSCAN_predicted_CDS_2|2433_bp atgtggcgccgcgggcttgccggtcacctgcagaggccggcacccctgtcgcgatgggtg gtggaggtaacatcttctcattcttcacattttagccctagccagacttgtactccaaat gcccaggccacaagacttctactgggagacagacacattagcaaccggccattatgtcca aggagaggtggaggggcccagacgaagtctttggagcttcagagttggcaggccccacat tcgctggacaaatccagggagggcacagaggtactgctggtcctgacagaaggggctttt tctcgtcgcaggaatccagcactctcttggaatcagtgtaacttctcagacacacttgag atccagttttattttggactggatcctgatcccaaatcagatcaaggggtggaggccccc aagaaccagtatacacagcagggcaatttgcatatttctccaaagaaccatccagaacct gagcagcctgtcttcagacagagagaggcccacggctgtttcttgaaatctggcaagtat ttcctgaatgcctacagtgatccaggtcctgtccagaaagctcggggtgtggcagtgaaa aaacaaccagggaactgcctcgtacacacagcgctgggaatggccatgtggaacaggcca tgccagaggctgcctcagcagcctctggtagctgagcccactgcagagggggagccacac ctgcccacgggccgggagctgactgaggccaaccgcttcgcctatgctgccctctgtggc atctccctgtcccagttatttcctgaacccgaacacagagcggccccatcccatcctccc ctggggactcctgtaggactgcactcagatgcccactatggacagagactggcggtggca gtgcacgtggagggcactcttcccttcacttcttccacttttgcagcagccagcgttttc cagaaagcacgtccttggcagcatccaggagagcagatggccttctctgggaagcctttt ctggtggctgccgtcacttctgcaggtggatccggggctccctctgtgggctgcactgaa ctgatctccacctctgtcctaggctccttctgcacagagttcatggcaggcctggtgcag tggctggagttgtctgaagctgtcttgccaaccatgactgcttttgcgagcggcctggga ggtgaaggagcagatgtgtttgttcaaattttactgaaggaccccatcttgaaggacgac ccgacggtgatcactcaggaccttctgagcttctcactcaaggatgggcactatgacgcc cgggccagagtcctcgtttgccacatgacctccctgctccaagtgcccttggaggagctg gatgtccttgaagagatgttcctggagagcctgaaggaaatcaaagaagaggaatctgac acaaaaaaaagtgatttaaaaacagaaggtggtggtgaggataaggacagaaaacaccct cttatgggagataagatgacagagagtaagaccgatttatggatctatgagatgggcaat gcgggcgtcttgctcttcaggagaatggccgaggcatcccgaaagaagaaagaaaaccgg aggaaatggaagcgttatctcctgataggcctggcgactgtcggaggcggaacggtgatc ggtgtgactggaggtctagctgcaccccttgttgccgctggagcagcgacgattattggc agcgccggggcagcggctctgggctcagcagccggcatagccatcatgacctcgctgttt ggtgcagctggagctggcctgacaggatacaagatgaagaagcgagtgggagccattgaa gagttcacgtttctgcctctgacggagggcaggcagctgcacatcaccatcgccgtcacg gggtggctcgcttctggcaaataccgcaccttcagtgccccgtgggctgccctggcccac agccgtgagcagtactgcctggcctgggaagccaagtacctgatggagctcggcaatgcc ctggagaccatcctcagtggtctcgccaacatggtggcccaggaggccctaaagtacaca gtgttgtctggcattgtggctgccctgacctggccagcctcactcctcagtgtcgccaat gtcatcgacaacccctggggggtgtgtctccatcgatcagcagaggttggcaagcacctg gcccacatcctgctctcccggcagcaggggcgacgacctgtcaccttgattggcttcagc ctgggagccagagtcatctacttctgtctgcaggagatggctcaagagaaagcgcctcct accctcttcctcaaccattcccctcccaagtccttgcccttgtattaccaatggggcatc ttcacacctccagatggtcgagatcgtcaatga >gi568815597r:19714421_19915101|GENSCAN_predicted_peptide_3|280_aa MSEKSAKDQDKVQAPGQSEKEGQIPSKSQRRQLTLSANAQVLSDSVPPVPVPRMACTKTL QQSQPISAGATTTTTAVAPAGGHSGSTECDLECLVCREPYSCPRLPKLLACQHAFCAICL KLLLCVQDNTWSITCPLCRKVTAVPGGLICSLRDHEAVVGQLAQPCTEVSLCPQGLVDPA DLAAGHPSLVGEDGQDEVSANHVAARRLAAHLLLLALLIILIGPFIYPGVLRWVLTFIIA LALLMSTLFCCLPSTRGSCWPSSRTLFCREQKHSHISSIA >gi568815597r:19714421_19915101|GENSCAN_predicted_CDS_3|843_bp atgagcgagaagtcagcaaaagaccaggacaaagtccaggcacctggccagagtgaaaag gaggggcagattccatccaagtcccagcggcgccagctgacactctctgccaatgcccag gtgctgagcgacagtgtcccaccggtccctgtgcccagaatggcctgcaccaagaccctg caacagtcccagcccatctccgcaggagccaccacaaccaccaccgctgtggcccctgct gggggtcattctggctccacagaatgtgacctggagtgtctggtgtgccgggagccctac agctgtccccggttgcccaagctgctggcctgccagcatgccttctgcgccatctgcctg aagctcctgctgtgcgtgcaggacaacacctggtccatcacctgcccgctgtgccgcaag gtcaccgccgtccccgggggcctcatctgcagcctgcgcgaccatgaggcggtggtgggg cagctggcccagccatgcacagaggtatcgctctgtcctcaggggctggtggatcctgct gacttggcagcaggacaccccagcttggtgggagaggatggacaggatgaagtaagtgca aaccacgtggcagcccggcgcctggccgcgcacctactcctgctggccttgctcattatc ctcatcgggcccttcatctacccgggtgtcttacgatgggtgctcaccttcatcatcgcc ctggccctgctgatgtccaccctcttctgctgtctccccagcacccggggcagctgctgg ccctcctccaggactctcttctgcagagagcagaaacacagccacatctcttccattgcc tga >gi568815597r:19714421_19915101|GENSCAN_predicted_peptide_4|90_aa MTERGGKKDTMFSKCDTEFKEKRREQQWEELAPFMAIWVVEQVSPGWGPGTGADDAVVNN EFTQDDDDCKYCGPLGSLAQTPNKDIAMEL >gi568815597r:19714421_19915101|GENSCAN_predicted_CDS_4|273_bp atgactgaaaggggaggaaagaaagacacaatgttttccaaatgtgatacagaattcaag gaaaagagaagagagcaacagtgggaagaacttgctccattcatggccatctgggtggtg gagcaggtgtctccagggtggggtcctggcacaggtgctgatgatgcagttgtaaataat gagtttacacaggatgacgatgactgcaaatactgtgggccattaggatctctggcacag acaccaaacaaagacattgctatggagctgtaa >gi568815597r:19714421_19915101|GENSCAN_predicted_peptide_5|396_aa MSRKQAAKSRPGSGSRKAEAERKRDERAARRALAKERRNRPESGGGGGCEEEFVSFANQL QALGLKLREVPGDGNCLFRALGDQLEGHSRNHLKHRQETVDYMIKQREDFEPFVEDDIPF EKHVASLAKPGTFAGNDAIVAFARNHQLNVVIHQLNAPLWQIRGTEKSSVRELHIAYRYG EHYDSVRRINDNSEAPAHLQTDMLHQDESNKREKIKTKGMDSEDDLRDEVEDAVQKVCNA TGCSDFNLIVQNLEAENYNIESAIIAVLRMNQGKRNNAEENLEPSGRVLKQCGPLWEEGG SGARIFGNQGLNEGRTENNKAQASPSEENKANKNQLAKVTNKQRREQQWMEKKKRQEERH RHKALESRGSHRDNNRSEAEANTQVTLVKTFAALNI >gi568815597r:19714421_19915101|GENSCAN_predicted_CDS_5|1191_bp atgtcccgaaagcaggcggcgaagagccggccgggcagcggcagccggaaagccgaggcc gagcgcaagcgggacgagcgggcggcgcgccgggccctggccaaggagcggcggaatcgg ccggagtctggcggcggcggcggctgcgaggaggagttcgtcagcttcgccaaccagctg caggccctggggctgaagctgcgggaggtgccgggggacggcaattgcttgttcagagct cttggtgatcaattggagggacactcacgaaatcatctcaagcacagacaggagacagtg gactacatgataaagcagcgggaagattttgaaccctttgtagaagatgacattcctttt gagaagcatgtggccagtttggcaaagcctggtacttttgctggcaatgatgcaattgta gcctttgcaagaaatcatcagttgaatgtagtgattcatcaacttaatgcccctttgtgg cagattcgtggtacagagaaaagcagcgtgagggagttacacatcgcatatcggtatgga gagcactacgacagtgttcggaggatcaatgacaactcagaggcacctgcacatctccag acggatatgcttcatcaagatgaatcaaataaaagagaaaagatcaagacaaagggaatg gactctgaagacgacctgagagatgaagtagaggatgctgtccagaaagtttgtaatgca actggatgttcagattttaatttaatagtccagaacctggaagctgaaaattataatatt gaatctgcaataattgccgtgcttcggatgaaccaagggaagagaaataatgcagaagag aatcttgagcccagtggtcgagtgctgaagcagtgtggccctttgtgggaggagggtggc agtggtgccagaatctttggaaatcagggcttaaatgaaggcaggaccgaaaacaataag gcacaggccagccctagtgaagaaaacaaagcaaataaaaaccagctcgcaaaggtcaca aacaaacagaggcgagaacagcagtggatggagaagaagaagcggcaggaggagaggcac cgccacaaagccctggagagcagaggtagccacagggacaataacagaagcgaagcagag gcgaacacgcaggtcaccttggtgaagaccttcgccgctctcaacatctga