GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:02:25 Sequence gi568815576r:22414005_22615630 : 201626 bp : 43.85% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13541 13586 46 1 1 109 91 24 0.885 3.65 1.02 Intr + 13709 14005 297 0 0 108 3 158 0.477 6.15 1.03 Intr + 18155 18522 368 0 2 112 84 199 0.720 16.77 1.04 Term + 21976 22110 135 0 0 90 50 70 0.549 1.42 1.05 PlyA + 22895 22900 6 1.05 2.05 PlyA - 24692 24687 6 1.05 2.04 Term - 27180 26890 291 0 0 33 45 131 0.041 -1.36 2.03 Intr - 34290 34225 66 0 0 108 92 29 0.151 4.40 2.02 Intr - 34739 34627 113 0 2 58 95 22 0.006 -0.00 2.01 Init - 41580 41556 25 0 1 81 89 41 0.181 3.23 2.00 Prom - 44502 44463 40 -2.26 3.03 PlyA - 46467 46462 6 1.05 3.02 Term - 48599 48327 273 2 0 88 43 131 0.227 4.07 3.01 Init - 75394 73832 1563 1 0 49 48 422 0.072 26.78 3.00 Prom - 75460 75421 40 -3.86 4.00 Prom + 86365 86404 40 -4.06 4.01 Init + 88752 88816 65 2 2 93 29 38 0.116 -0.98 4.02 Intr + 93568 93691 124 1 1 99 -38 94 0.403 -1.51 4.03 Term + 94515 94871 357 2 0 99 48 235 0.605 15.11 4.04 PlyA + 95417 95422 6 1.05 5.02 PlyA - 95430 95425 6 1.05 5.01 Sngl - 101626 99998 1629 1 0 51 48 356 0.668 23.22 5.00 Prom - 113346 113307 40 -2.66 6.07 PlyA - 113685 113680 6 1.05 6.06 Term - 134639 134063 577 1 1 96 37 291 0.896 18.74 6.05 Intr - 136330 135722 609 0 0 131 94 515 0.985 47.93 6.04 Intr - 137085 136763 323 0 2 95 110 321 0.976 29.66 6.03 Intr - 143576 143541 36 2 0 113 111 5 0.541 3.76 6.02 Intr - 145400 145264 137 0 2 98 54 9 0.021 -1.21 6.01 Init - 156757 156625 133 1 1 78 47 103 0.849 5.50 6.00 Prom - 167995 167956 40 -6.36 7.00 Prom + 168958 168997 40 -7.26 7.01 Init + 174209 174254 46 1 1 109 61 101 0.970 8.35 7.02 Intr + 174377 174636 260 0 2 98 4 102 0.804 -0.02 7.03 Intr + 180781 181077 297 0 0 70 50 227 0.296 14.17 7.04 Term + 186983 187078 96 1 0 100 37 85 0.416 2.57 7.05 PlyA + 191912 191917 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 75394 73763 1632 1 0 49 31 403 0.910 26.02 S.002 Init + 165838 165883 46 0 1 109 66 93 0.951 8.05 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576r:22414005_22615630|GENSCAN_predicted_peptide_1|281_aa MAWTPLLLLLLSHCTGSLSQPVLTQPPSSSASPGESARLTCTLPSDINVGSYNIYWYQQK PGSPPRYLLYYYSDSDKGQGSGVPSRFSGSKDASANTGILLISGLQSEDEADYYWSWAQS VLTQPPSVSEAPRQRVTISCSGSSSNIGNNAVNWYQQLPGKAPKLLIYYDDLLPSGVSDR FSGSKSGTSASLAISGLQSEDEADYYCAAWDDSLNGPTVLQAQGEQRQEPPSFSARRYNV DYGFERVGFSYHKVCLFYVNFAKAFNHEVMLDFAKCFFCMH >gi568815576r:22414005_22615630|GENSCAN_predicted_CDS_1|846_bp atggcctggactcctcttcttctcttgctcctctctcactgcacaggttccctctcccag cctgtgctgactcagccaccttcctcctccgcatctcctggagaatccgccagactcacc tgcaccttgcccagtgacatcaatgttggtagctacaacatatactggtaccagcagaag ccagggagccctcccaggtatctcctgtactactactcagactcagataagggccagggc tctggagtccccagccgcttctctggatccaaagatgcttcagccaatacagggatttta ctcatctccgggctccagtctgaggatgaggctgactattactggtcctgggcccagtct gtgctgactcagccaccctcggtgtctgaagcccccaggcagagggtcaccatctcctgt tctggaagcagctccaacatcggaaataatgctgtaaactggtaccagcagctcccagga aaggctcccaaactcctcatctattatgatgatctgctgccctcaggggtctctgaccga ttctctggctccaagtctggcacctcagcctccctggccatcagtgggctccagtctgag gatgaggctgattattactgtgcagcatgggatgacagcctgaatggtcccacagtgctc caggcccagggggaacagagacaagaacccccttccttttctgccaggaggtataatgtt gactatgggtttgaaagagtgggcttttcttaccataaggtatgtctcttctatgtcaat tttgcaaaggcttttaaccatgaagtgatgctggattttgccaagtgctttttctgcatg cattga >gi568815576r:22414005_22615630|GENSCAN_predicted_peptide_2|164_aa MDTLFIRGGGMESVSTEENVFKYVSYLDKIIHSLENTVRLQVPAGKKIQDISQSVLYHSI PAHVTHRATCSTIYLVLERNKYLVFTRVQGPLNQTYSRGHLCEGVGELKRPSWVQLWTLL QVIVAVDVDFFVDVDWDGTIPATLDSDMGKGSRENRGMDQNQKQ >gi568815576r:22414005_22615630|GENSCAN_predicted_CDS_2|495_bp atggatacgctcttcatccgtggtggtggcatggagtcagtgtccacagaagaaaatgtc tttaaatatgttagttacctggacaagattattcactccttagaaaatacagtaagatta caagtcccagctggcaagaaaatacaagacatcagccagagtgtgctctaccacagcatc ccagcccatgtcactcacagggcaacctgctccactatatacctggtgttagagaggaat aaataccttgtgttcaccagggtccaggggccactcaaccagacctacagtagaggtcac ctgtgtgaaggggtgggggagctgaagaggccctcttgggtacaactgtggactctgctt caagtcatagtagctgtagatgtagacttctttgtagatgttgactgggatggaaccatc cctgccactctggactcagacatgggcaaggggtccagggaaaacagaggcatggaccag aatcagaagcaataa >gi568815576r:22414005_22615630|GENSCAN_predicted_peptide_3|611_aa MEQSCEEEKEPEPQKNIQETKQVDDEDAELIFVGVEHVNEDAELIFVGVTSNSKPVVSNI LNRVTPGSWSRRKKYDHLRKDTARKLQPKSHETVTSEAVTVLPASQLESRSTDSPIIIEP LSKPDYRNSSPQVVPNNSSELPSPLITFTDSLHHPVSTALSVGGINESPRVSKQLSTFEV NSINPKRAKLRDGIIEGNSSASFPSDTFHTMNTQQSTPSNNVHTSLSHVQNGAPFPAAFP KDNIHFKPINTNLDRANELAKTDILSLTSQNKTFDPKKENPIVLLSDFYYGQHKGEGQPE QKTHTTFKCLSCVKVLKNVKFMNHVKHHLEFEKQRNDSWENHTTCQHCHRQFPTPFQLQC HIENVHTAQEPSTVCKICELSFETDQVLLQHMKDHHKPGEMPYVCQVCHYRSSVFADVET HFRTCHENTKNLLCPFCLKIFKTATPYMCHYRGHWGKSAHQCSKCRLQFLTFKEKMEHKT QCHQMFKKPKQLEGLPPETKVTIQVSLEPLQPGSVDVASITKLTQLAEAQPQEIEPPLFN PGNLVLVKTLTSLSFPKPSCEGPYTVLFQPPWQQKLQVSTSEYITLKSKPECLREQPLTA QRNVLNIKVEK >gi568815576r:22414005_22615630|GENSCAN_predicted_CDS_3|1836_bp atggaacaatcatgtgaggaagagaaagagcctgaaccacagaagaacatacaagaaacc aaacaagtagatgacgaagatgctgagctcatctttgttggtgtggaacatgtaaatgaa gatgctgagctaatctttgttggggtgacttcaaattcaaaaccagtcgtttcaaacatt ttgaacagagtcaccccgggttcatggtcaaggagaaaaaagtatgatcaccttagaaaa gatactgctcgcaaattgcagcctaaaagtcatgagaccgttacatcagaagcagtgacc gtcctgccagcttcccaacttgaatcgagatcaacagatagtcctattattattgagcct ttgtctaaacctgattatagaaatagttcaccacaagttgtgcctaataactcttcagaa ttaccttctcctttgattacattcacagattcattgcatcatccagtaagtacagcactt tcagtaggaggtataaatgaaagtcctcgtgtatcaaagcaactttccactttcgaagta aacagcataaatcccaaaagggctaaactcagggatggaattatagaaggaaattcttca gcttcattcccttcagatacctttcatacaatgaatactcagcaaagtacaccctcaaat aatgttcatacctcattaagccatgttcagaatggagcaccttttccagcagcttttcca aaggacaatatccatttcaagcctataaatacaaatcttgatagggcaaatgaattggca aaaacagacattttgagtctaacaagtcaaaacaagacctttgatcccaagaaagaaaat cccattgtgttacttagtgacttttactatggacagcataaaggagaagggcagccggaa cagaagactcacaccacctttaaatgcctcagctgcgtgaaagttctaaaaaatgttaag tttatgaatcacgtgaagcatcatttggaatttgagaagcagaggaacgacagctgggaa aaccacaccacctgccagcactgccaccggcagtttcccactcccttccagctacagtgt cacatcgaaaatgtccacactgcccaggagccctctactgtctgtaaaatctgtgaattg tcatttgaaacagatcaggtcctcttacaacacatgaaggaccatcataagcctggcgaa atgccctatgtgtgccaggtttgccattatagatcgtcggtctttgctgatgtagaaaca cattttagaacgtgccatgaaaacacaaagaatttgctttgtcccttttgtctcaaaatt ttcaaaacagcaacaccatacatgtgtcattataggggccactggggaaagagtgcacac cagtgttccaagtgccggctacagtttttaactttcaaggagaaaatggagcacaagacc cagtgtcatcaaatgtttaagaagcctaagcaactagaaggattacctcctgaaacaaaa gttactattcaagtgtcgctggaacctcttcagccaggatcagtggatgtagcatccata actaaattgacacaactggcagaagcccagccccaggaaatagaaccacctttatttaac ccaggaaatttggtactagtgaaaactctcacctctctctccttccctaagccaagctgc gaagggccctacactgttctctttcaacccccttggcagcaaaagttacaggtgtcaact tctgaatacatcacactcaagtcaaagcctgaatgcctgagggagcaacccctgacagcc cagaggaacgtcctgaatatcaaagtggagaaatag >gi568815576r:22414005_22615630|GENSCAN_predicted_peptide_4|181_aa MGSMKKNERNIAGLKNGPQNIRDVVKECHYHPLLSQQSNLEGSRMTYINNWPELIKHIIQ IPSKAAVLQRPLRQRTAKRGVATRRAAARHEIPQVSPQGTLKPRHAPVLTSPDQLLFASC RPRTSPGGAPGAQPAATTPPQAPGAADAHASTRARPVETCDWLPGAGARSARLRARSMRA L >gi568815576r:22414005_22615630|GENSCAN_predicted_CDS_4|546_bp atgggctcgatgaagaagaatgaaaggaatatagcaggtttgaaaaatggcccccaaaat atcagagatgtggtaaaagaatgccactaccaccctctcctttctcaacaaagcaattta gagggttcccgaatgacctacatcaacaattggcctgaactaatcaaacacatcattcag attccctcaaaagcggctgtattgcaaaggccgctccggcagcgcacggccaagcgcggt gtcgcgacccgcagggctgcggctcgccatgaaatccctcaagtctcccctcagggaacg ctgaagccgcgccacgcccccgtccttaccagtccggatcagctgctgttcgcgagctgc cggccacgcaccagccccggaggcgctcccggggcacagccggcggcgactacgcctcct caggcccccggcgccgccgacgcgcacgcctccacacgcgcgcgtccagtggagacctgc gattggctgccaggtgccggcgcgagatcggcgcggctccgagctaggagcatgcgcgcg ctctga >gi568815576r:22414005_22615630|GENSCAN_predicted_peptide_5|542_aa MGDIFLCKKVESPKKNLRESKQREEDDEDPDLIYVGVEHVHRDAEVLFVGMISNSKPVVS NILNRVTPGSNSRRKKGHFRQYPAHVSQPANHVTSMAKAIMPVSLSEGRSTDSPVTMKSS SEPGYKMSSPQVVSPSSSDSLPPGTQCLVGAMVSGGGRNESSPDSKRLSTSDINSRDSKR VKLRDGIPGVPSLAVVPSDMSSTISTNTPSQGICNSSNHVQNGVTFPWPDANGKAHFNLT DPERASESALAMTDISSLASQNKTFDPKKENPIVLLSDFYYGQHKGDGQPEQKTHTTFKC LSCVKVLKNIKFMNHMKHHLEFEKQRNDSWEDHTTCQHCHRQFPTPFQLQCHIDSVHIAM GPSAVCKICELSFETDQVLLQHMKDHHKPGEMPYVCQVCHYRSSVFADVETHFRTCHENT KNLLCLFCLKLFKTAIPYMNHCWRHSRRRVLQCSKCRLQFLTLKEEIEHKTKDHQTFKKP EQLQGLPSETKVIIQTSVQPGSSGMASVIVSNTDPQSSPVKTKKKTAMNTRDSRLPCSKD SS >gi568815576r:22414005_22615630|GENSCAN_predicted_CDS_5|1629_bp atgggagatatctttttgtgtaagaaagtggaatcaccaaagaagaatttgagagaatcc aaacaaagggaggaggatgatgaagatccagatctgatctatgttggggtggagcatgta catagagatgctgaagttctctttgtcgggatgatttcaaattcaaaaccagtcgtttca aacattttgaacagagtcaccccaggctcaaattcaagaagaaagaaaggccacttccgt caatatcctgctcacgtgtcgcagcctgcaaatcatgtgacctctatggcaaaagccatc atgccggtttctctgtctgaggggcgatcgacagatagtcctgtcactatgaagtcttca tctgaacctggttataaaatgagctcaccacaagttgtttctcccagttcctcagactcg ctccccccagggactcagtgtctagttggagctatggtctctggaggaggcagaaatgag agttctcctgattcaaagcgactttccacttcagatataaacagcagagattccaaaagg gttaaactcagggatggaatcccaggggtaccttctttagctgtggtcccttcagatatg tcttctacaataagcacaaatacaccctcacaggggatctgcaactcatcaaaccatgtt cagaatggagtaacatttccttggcctgatgctaatggaaaggcacatttcaatcttaca gatccagagagagcaagtgagtctgccctggcaatgacagacatttcaagtctagcaagt caaaacaagacctttgatcccaagaaagaaaatcccatcgtgttacttagcgacttttac tatggacagcataaaggagatgggcagccggaacagaagactcacaccacctttaaatgc ctcagctgcgtgaaagttctaaaaaatattaagtttatgaatcacatgaagcatcatttg gaatttgagaagcagaggaacgacagctgggaagaccacaccacctgccagcactgccac cggcagtttcccactcccttccagctacagtgtcacattgatagtgtacacatcgccatg gggccctctgctgtctgtaaaatctgtgaattgtcatttgaaacagatcaggtcctctta caacacatgaaggaccatcataagcctggcgaaatgccttatgtgtgccaggtttgccat tacagatcgtcggtctttgctgatgtggaaacacattttagaacgtgccatgaaaacaca aagaatttgctttgtctgttttgtctcaaacttttcaaaactgcaataccatacatgaat cattgttggaggcacagcagaaggagggtccttcagtgttccaagtgccggctacagttt ttgacgttgaaggaggaaatagagcacaaaaccaaggaccatcaaacatttaaaaagccg gagcaactgcaagggttgcctagtgaaacaaaagttattattcaaacttcagttcagcca ggatcaagtggtatggcttccgttattgttagcaacactgaccctcagtcttctcctgta aaaactaaaaagaagacggctatgaacactagagattccagactcccttgcagcaaggat tctagctga >gi568815576r:22414005_22615630|GENSCAN_predicted_peptide_6|604_aa MEHYAAIKIDEFMSFVGTWMKLETIILSKLSQGRKTKHRMFSLIGKDPPHLCRSLTIPLE GLGGSGFCIQSLLTLVPPAGPRREFRRGFRQLRGVVNSLRKNGSIQSRYISMSVWTSPRR LVELAGQSLLKDEALAIAALELLPRELFPPLFMAAFDGRHSQTLKAMVQAWPFTCLPLGV LMKGQHLHLETFKAVLDGLDVLLAQEVRPRRWKLQVLDLRKNSHQDFWTVWSGNRASLYS FPEPEAAQPMTKKRKVDGLSTEAEQPFIPVEVLVDLFLKEGACDELFSYLIEKVKRKKNV LRLCCKKLKIFAMPMQDIKMILKMVQLDSIEDLEVTCTWKLPTLAKFSPYLGQMINLRRL LLSHIHASSYISPEKEEQYIAQFTSQFLSLQCLQALYVDSLFFLRGRLDQLLRHVMNPLE TLSITNCRLSEGDVMHLSQSPSVSQLSVLSLSGVMLTDVSPEPLQALLERASATLQDLVF DECGITDDQLLALLPSLSHCSQLTTLSFYGNSISISALQSLLQHLIGLSNLTHVLYPVPL ESYEDIHGTLHLERLAYLHARLRELLCELGRPSMVWLSANPCPHCGDRTFYDPEPILCPC FMPN >gi568815576r:22414005_22615630|GENSCAN_predicted_CDS_6|1815_bp atggaacactatgcagccataaaaattgatgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactgtcacaaggacgaaaaaccaaacaccgcatg ttctcactcataggaaaggatcctccccatctctgcagaagcctgaccatccccctagag ggcctgggaggaagtgggttttgcatacagtccctgttgactctagtgccccctgctggc cccagacgcgagttccggcgaggcttcaggcaacttcgcggtgtggtgaactctctgagg aaaaacggttccattcagagccgatacatcagcatgagtgtgtggacaagcccacggaga cttgtggagctggcagggcagagcctgctgaaggatgaggccctggccattgccgccctg gagttgctgcccagggagctcttcccgccactcttcatggcagcctttgacgggagacac agccagaccctgaaggcaatggtgcaggcctggcccttcacctgcctccctctgggagtg ctgatgaagggacaacatcttcacctggagaccttcaaagctgtgcttgatggacttgat gtgctccttgcccaggaggttcgccccaggaggtggaaacttcaagtgctggatttacgg aagaactctcatcaggacttctggactgtatggtctggaaacagggccagtctgtactca tttccagagccagaagcagctcagcccatgacaaagaagcgaaaagtagatggtttgagc acagaggcagagcagcccttcattccagtagaggtgctcgtagacctgttcctcaaggaa ggtgcctgtgatgaattgttctcctacctcattgagaaagtgaagcgaaagaaaaatgta ctacgcctgtgctgtaagaagctgaagatttttgcaatgcccatgcaggatatcaagatg atcctgaaaatggtgcagctggactctattgaagatttggaagtgacttgtacctggaag ctacccaccttggcgaaattttctccttacctgggccagatgattaatctgcgtagactc ctcctctcccacatccatgcatcttcctacatttccccggagaaggaagagcagtatatc gcccagttcacctctcagttcctcagtctgcagtgcctgcaggctctctatgtggactct ttatttttccttagaggccgcctggatcagttgctcaggcacgtgatgaaccccttggaa accctctcaataactaactgccggctttcggaaggggatgtgatgcatctgtcccagagt cccagcgtcagtcagctaagtgtcctgagtctaagtggggtcatgctgaccgatgtaagt cccgagcccctccaagctctgctggagagagcctctgccaccctccaggacctggtcttt gatgagtgtgggatcacggatgatcagctccttgccctcctgccttccctgagccactgc tcccagcttacgaccttaagcttctacgggaattccatctccatatctgccctgcagagt ctcctgcagcacctcatcgggctgagcaatctgacccacgtgctgtatcctgtccccctg gagagttatgaggacatccatggtaccctccacctggagaggcttgcctatctgcatgcc aggctcagggagttgctgtgtgagttggggcggcccagcatggtctggcttagtgccaac ccctgtcctcactgtggggacagaaccttctatgacccggagcccatcctgtgcccctgt ttcatgcctaattag >gi568815576r:22414005_22615630|GENSCAN_predicted_peptide_7|232_aa MAWALLLLTLLTQGTGSWAQSALTQPPFVSGAPGQSVTISCTGTSSDVGDYDHVFWYQKR LSTTSRLLIYNVNTRPSGISDLFSGSKSGNMASLTISGLKSEVPAVSVALGQMARITCQG DSMEGSYEHWYQQKPGQAPVLVIYDSSDRPSRIPERFSGSKSGNTTTLTITGAQAEDEAD YYYQLIDNHATQLTVTQADGKWYQYGPHDGSSIPANLGLHTADCGWSQNPKC >gi568815576r:22414005_22615630|GENSCAN_predicted_CDS_7|699_bp atggcctgggctctgctcctcctcaccctcctcactcagggcacagggtcctgggcccaa tctgccctgactcagcctccttttgtgtccggggctcctggacagtcggtcaccatctcc tgcactggaaccagcagtgacgttggggattatgatcatgtcttctggtaccaaaagcgt ctcagcactacctccagactcctgatttacaatgtcaatactcggccttcagggatctct gacctcttctcaggctccaagtctggcaacatggcttccctgaccatctctgggctcaag tccgaggtgcctgcagtgtctgtggccttgggacaaatggccaggatcacctgccaggga gacagcatggaaggctcttatgaacactggtaccagcagaagccaggccaggcccccgtg ctggtcatctatgatagcagtgaccggccctcaaggatccctgagcgattctctggctcc aaatcaggcaacacaaccaccctgaccatcactggggcccaggctgaggatgaggctgat tattactatcagttgatagacaaccatgctactcaactcacagtgacacaggcagatgga aaatggtaccagtacggtcctcatgacggttcatctatacctgccaatctaggcctccac actgccgactgtggatggtcacagaacccaaaatgctaa