GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:37:28 Sequence gi568815590r:99787508_99992021 : 204514 bp : 40.10% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4829 4882 54 1 0 84 67 60 0.493 4.83 1.02 Intr + 21868 22023 156 0 0 104 111 155 0.999 18.69 1.03 Intr + 30033 30296 264 2 0 77 36 189 0.929 9.39 1.04 Intr + 30944 31027 84 1 0 67 101 28 0.559 1.10 1.05 Intr + 31206 31381 176 2 2 65 74 6 0.544 -5.28 1.06 Intr + 31905 32075 171 0 0 53 90 140 0.690 8.84 1.07 Intr + 32414 32615 202 2 1 53 79 146 0.998 8.57 1.08 Intr + 33787 33975 189 0 0 73 116 172 0.998 17.36 1.09 Intr + 36089 36207 119 1 2 52 93 45 0.243 -0.26 1.10 Intr + 42779 43037 259 2 1 73 27 208 0.432 9.74 1.11 Intr + 44862 45145 284 2 2 105 68 127 0.932 7.69 1.12 Intr + 47690 47817 128 2 2 101 95 65 0.997 7.90 1.13 Intr + 48032 48235 204 0 0 71 44 131 0.695 5.25 1.14 Intr + 56052 56286 235 1 1 100 -29 165 0.336 1.82 1.15 Intr + 58040 58148 109 2 1 106 27 115 0.154 6.57 1.16 Intr + 61269 61387 119 2 2 91 96 115 0.951 10.94 1.17 Intr + 65944 66749 806 1 2 120 110 599 0.962 55.46 1.18 Intr + 69906 70045 140 1 2 49 90 33 0.850 -1.14 1.19 Intr + 70585 70785 201 0 0 99 86 122 0.875 11.66 1.20 Intr + 71333 71486 154 1 1 38 36 128 0.579 1.32 1.21 Intr + 71797 71973 177 2 0 121 69 146 0.999 14.97 1.22 Intr + 74269 74439 171 2 0 122 87 205 0.999 22.79 1.23 Intr + 75953 76206 254 0 2 81 114 107 0.538 8.83 1.24 Intr + 78982 79167 186 0 0 52 27 134 0.613 2.66 1.25 Intr + 79859 79950 92 1 2 79 83 74 0.767 3.87 1.26 Intr + 80717 80958 242 2 2 59 115 273 0.803 23.37 1.27 Intr + 83278 83380 103 2 1 108 58 80 0.995 5.31 1.28 Intr + 83941 84190 250 1 1 108 94 299 0.986 29.02 1.29 Term + 87911 88159 249 1 0 71 47 110 0.343 -0.08 1.30 PlyA + 89899 89904 6 1.05 2.06 PlyA - 90381 90376 6 1.05 2.05 Term - 100111 99998 114 1 0 44 33 103 0.439 -2.01 2.04 Intr - 104545 104401 145 0 1 72 95 124 0.857 10.86 2.03 Intr - 106044 105723 322 1 1 34 44 249 0.375 9.09 2.02 Intr - 106766 106606 161 1 2 67 80 59 0.811 1.71 2.01 Init - 114544 114291 254 1 2 55 91 195 0.908 13.36 2.00 Prom - 125295 125256 40 -2.95 3.03 PlyA - 125389 125384 6 1.05 3.02 Term - 130997 130927 71 0 2 102 43 29 0.185 -3.08 3.01 Init - 140340 140220 121 0 1 68 74 110 0.439 8.00 3.00 Prom - 147027 146988 40 -5.25 4.07 PlyA - 148501 148496 6 1.05 4.06 Term - 154865 154751 115 1 1 70 42 61 0.121 -3.24 4.05 Intr - 158864 158746 119 2 2 39 109 120 0.260 7.54 4.04 Intr - 177923 177828 96 2 0 45 115 60 0.024 3.69 4.03 Intr - 190529 190410 120 2 0 43 100 59 0.308 2.37 4.02 Intr - 194609 194430 180 2 0 98 110 113 0.988 13.64 4.01 Init - 195989 195951 39 2 0 70 73 17 0.587 -1.36 4.00 Prom - 196095 196056 40 -1.65 5.00 Prom + 201536 201575 40 -6.65 5.01 Sngl + 203746 204012 267 0 0 79 38 266 0.695 15.69 5.02 PlyA + 204032 204037 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:99787508_99992021|GENSCAN_predicted_peptide_1|1925_aa MTQMLALSDMYSKVSVSKLLHICIEGWGNWRWSEPFSVDHAGTFIRTIQYRGRTASLIIK VQQLNGVQKQIIICGRQIICSYLSQSIELKVVQHYIGQDGQAVVREHFDCLTAKQKLPSY ILENNELTELCVKAKGDEDWSRDVCLESKAPEYSIVIQVPSSNSSIIYVWCTVLTLEPNS QVQQRMIVFSPLFIMRSHLPDPIIIHLEKRSLGLSETQIIPGKGQEKPLQNIEPDLVHHL TFQAREEYDPSDCAVPISTSLIKQIATKVHPGGTVNQILDEFYGPEKSLQPIWPYNKKDS DRNEQLSQWDSPMRVKLSIWKPYVRTLLIELLPWALLINESKWDLWLFEGEKIVLQVPAG KIIIPPNFQEAFQIGIYWANTNTVHKSVAIKLVHNLTSPKWKDGGNGEVVTLDEEAFVDT EIRLGAFPGHQKNSSKSNRAAKAPCCRLNTDRGQNISENERNAIKYSGEKQRDALLREEE SREAFWLQRLCGAVVGSSQSELSGGFVYTVREKPPTQTSVMVDAPPPTKLEHPKSTSDCC AGSENFKPVDLSLLGSLGYFRVPDSATFSICPGGEQPAMKSSSLPCWDLMPDISQSVLDA SLLQKQIMLGFSPAPGADSSQCWSLPAIVRPEFPRQSVAVPLGNFRENGFCTRAIVLTYQ EHLGVTYLTLSEDPSPRVIIHNRCPVKMLIKENIKDIPKFEVYCKKIPSECSIHHELYHQ ISSYPDCKTKDLLPSLLLRVEPLDEVTTEWSDAIDINSQGTQVRQGNSNNPSTVEGGRMR DIRQSLAYGSSECFLGTYCGVYTLAARNQLSLSTVSNPEGALTIVLQVPGSTFWEIFLGH LKRLAQGGKQENPNKQAVSSFLLADVMLAEESRIATPNVVFLTGFGYVYVDVVHQCGTVF ITVAPEGKAGPILTNTNRAPEKIVTFKMFITQLSLAVFDDLTHHKASAELLRLTLDNIFL CVAPGAGPLPGEEPVAALFELYCVEICCGDLQLDNQLYNKSNFHFAVLVCQGEKAEPIQC SKMQSLLISNKELEEYKEKCFIKLCITLNEGKSILCDINEFSFELKPARLYVEDTFVYYI KTLFDTYLPNSRLAGHSTHLSGGKQVLPMQVTQHARALVNPVKLRKLVIQPVNLLVSIHA SLKLYIASDHTPLSFSVFERGPIFTTARQLVHALAMHYAAGALFRAGTPLQPFRAKGNLL FLYLQRRPSMEEYSTASPAPSPTLCSLSSQKKESLLVKEGWLPPSPVPTAFPLPGSEHQA GEVSGFPEKAVFSDSSSPGPSLAPPILGAYSPKSPPQRARKFASPLRHAIEHLAAPGQAS PPILVPPLSRGGFRENLPPANTCNPRRRPSSGWVVGSLDILGSPASLVRSIGNGVADFFR LPYEGLTRGPGAFVSGVSRGTTSFVKHISKGTLTSITNLATSLARNMDRLSLDEEHYNRQ EEWRRQLPESLGEGLRQGLSRLGISLLALIGASRQRERERPTPGLAGVWLSPPQPPTPFL PEGSSLFPVRETPANAPFSVCKIVPDGKNQLCEESQISPDEVIFLGVGPVDRNLTPCHGT QAFHFWGAAQIVLSVKATVTEAFTVARENGRAEQRGPHLIPACAGAGGCQHPVRLPQAQC RRWMPCATAVGAELQGVLVAILSEEWSLSFSEDFKPLSLLRAFVIPGAIAGIVDQPMQNF QKTSEAQASAGHKAKGVISGVGKGIMGVFTKPIGGAAELVSQTGYGILHGAGLSQLPKQR HQPSDLHADQAPNSHVKYVWKMLQSLGRPEVHMALDVVLVRGSGQEHEGCLLLTSEVLFV VSVSEDTQQQAFPVTEIDCAQDSKQNNLLTVQLKQPRVACDVEVDGVRERLSEQQYNRLV DYITKTSCHLAPSCSSMQIPCPVVAAEPPPSTVKTYHYLVDPHFAQVFLSKFTMVKNKAL RKGFP >gi568815590r:99787508_99992021|GENSCAN_predicted_CDS_1|5778_bp atgacccagatgttggcattatcggacatgtactctaaagtgtctgtgtcaaagctgtta cacatctgtattgaaggttggggcaactggcgttggtcagagcctttcagtgtggaccat gccgggacttttattagaacaattcagtacaggggtcgaactgcttctctcatcatcaag gttcagcaactcaatggagtacaaaaacagattatcatctgtggaagacagatcatctgt agttacttgtctcaaagcatagaactaaaagtcgttcagcattacattggtcaagatgga caagctgtagttcgggaacattttgactgcctcacagccaaacagaaattgccttcgtac atactagaaaacaatgaactgacggagctgtgtgtgaaggccaaaggagatgaagactgg tcaagagatgtgtgcctggaatccaaagcccctgagtacagcattgtcattcaggtgcca tcttcaaacagttccattatttatgtctggtgcacagttttgactttagaacccaactct caagtgcaacaacgaatgattgtgttcagccctctttttatcatgaggagtcatcttcca gaccccattatcatacatttggagaaaaggagtctgggattgagtgaaacacaaattatt ccaggaaaagggcaggaaaaaccactgcaaaacatagaacctgaccttgtacatcacctg acattccaagcaagagaagaatatgatccttcagattgtgcagttcccatctcaacatcc ctcattaagcaaatagccactaaggtacaccctggaggcacagttaatcagatccttgac gaattctatgggccagaaaagtcgcttcaacccatatggccctataataagaaggattct gacaggaatgaacagctaagtcagtgggatagcccaatgcgagtgaagctgtcaatctgg aagccatatgttagaactttgttgatagaacttctgccctgggccctgcttatcaatgaa tccaaatgggacctctggctatttgaaggagagaaaattgttctacaggttcctgctggc aaaattattattcctcctaattttcaggaagcttttcaaattggaatatactgggcaaat acaaacactgtgcacaagtcagtagcaattaaactggtccataacctgacatctccaaag tggaaagatggaggtaatggtgaagttgtgacactggatgaagaagcgtttgttgatact gaaataagacttggtgcttttccaggacatcagaagaacagcagcaaaagtaacagggct gctaaagcaccatgttgtaggctaaatactgacaggggccagaacatttctgagaatgaa aggaacgccattaaatattctggggaaaaacaaagagatgccctgctcagagaggaggaa tctagagaggcattctggctacagcggctttgcggggctgtggtgggctcttcccagtcc gaactttctggaggctttgtttacactgtgagggagaaaccacctactcaaacctcagta atggtggatgcccctcctcccaccaagctggagcatcccaagtcaacttcagactgctgt gctggcagtgagaatttcaagccagtggatcttagcctgctgggctccctggggtatttt cgtgttccagacagtgctacttttagcatttgcccaggtggagagcagcctgctatgaaa tccagctcccttccttgctgggacttgatgcctgacatcagtcagtcagtactggatgca tccctgcttcagaaacagatcatgctgggcttttctcctgccccaggtgctgacagctca cagtgctggagcctgccagctatagttagaccagagtttcccagacagagtgtggcagta cccctcgggaatttccgggaaaatggattctgtaccagggctatagtgctgacatatcaa gaacacctcggagtgacttatttaaccctctcagaagaccctagtcctcgagtaattatc cacaatagatgtccagtaaaaatgcttataaaggaaaacattaaagatattccaaagttt gaggtttattgcaaaaaaattccctccgagtgctcaattcatcatgagctgtatcatcag atttccagttatccggactgcaagaccaaagacttacttccaagcctacttttgagagtt gaacctctagatgaagtaacaactgagtggagtgatgccattgacatcaacagtcaggga acacaggtcagacagggtaacagtaataacccctccactgtggaagggggaagaatgaga gacatacggcagtctcttgcctatggcagttctgaatgttttctgggcacatattgtggg gtctataccctggcagcaagaaatcagctttcactgagcactgtttctaatcctgaagga gctcttaccattgtccttcaggttcctggctccaccttctgggagatcttccttggacac ttgaagaggctagcccagggtggcaagcaggaaaaccccaataagcaagctgtatcaagc ttcttacttgctgatgtcatgttagctgaagaaagtcgcatagccacgcccaatgttgtg ttcctgactggctttggctatgtgtatgtggatgttgtacatcagtgtggcacagtcttc atcactgtggccccagaaggaaaagcaggacctattttaaccaataccaacagagcgcca gagaagattgttacatttaaaatgttcatcactcagttaagcctggcagtgtttgatgac ctcacccaccacaaagcatcagctgagcttctgagactcacactggacaacatttttctc tgtgtggccccgggagctggtcccctccctggggaagagcctgtggctgcgttgtttgaa ctttactgtgtggagatctgctgtggggacctgcagctagacaaccagctttataacaag tccaatttccactttgctgtcttagtctgccagggagaaaaagcagaacccattcagtgt tccaaaatgcagagtctcctcatatccaacaaagagttggaagaatacaaggaaaaatgt tttatcaaactttgcatcaccttaaatgaaggcaagagcatcctctgtgatattaatgag ttcagctttgaattaaaacctgctcggttatacgtggaagacacatttgtatactacatc aagactttgtttgacacctaccttcctaacagcaggttggctggtcactccacacacctc tccgggggtaaacaggtgttgcccatgcaggtcacacagcacgccagggccttggtgaat cctgtgaagttacggaaactggtgatccagccagtaaatttgctcgtcagcatccacgct tccctcaagctgtacatagcctcagaccacactcctctctccttctcggtgtttgaaaga ggacccatcttcaccactgcgaggcagcttgtgcacgccctggcaatgcactatgccgct ggggccctttttagagcaggtacacccttacagccattcagggctaaagggaatctgctg tttctttacctccagagaaggccctccatggaggaatattccacagcctccccagctccc tcgcctactctttgcagcctgagctcacagaagaaagagagcctgctggtgaaggaggga tggctcccacccagtcctgtccccacagccttcccattaccgggctctgagcaccaggct ggggaggtcagtggcttcccggagaaagctgtcttttcagactcctcctctcctgggccc tcacttgcccctccaattcttggggcctacagccccaaatccccaccccagagagccagg aagtttgcatcccctttgcgacatgccattgagcatctggccgcccctggccaggcatct cctcctatcctggtgcctccgctgtcccggggaggctttagagagaacctcccacctgca aacacctgcaacccaagaaggcgcccttcctcaggctgggtagttgggtctctggatatt cttggcagccctgcaagcctggtgagaagcatcgggaacggggtcgccgacttcttcagg cttccgtatgaggggctgacccggggccctggagccttcgtgagtggcgtctccagaggg accacatcgtttgtaaagcacatctccaaaggtaccctcacatccatcaccaacctcgcc acaagcctggcccggaacatggaccggctctcactggatgaggagcactacaaccggcag gaggagtggcggcggcagctccccgagagcctgggcgaggggcttcgacagggcctgtcc cggctgggcatcagcctgcttgcccttataggagcttctaggcaaagagagagagagaga cccacccctgggctggctggtgtgtggctgtctccaccccaacccccgactcccttcctg cccgagggttccagtctcttcccagtcagagaaaccccagccaatgctccgttttcagtg tgtaagattgtacctgatgggaaaaatcagttgtgtgaggagtcacaaatcagccctgat gaggtcatctttctgggcgtgggtcctgttgaccggaatctaacaccttgtcatgggact caggctttccacttttggggggctgcccaaatagtcttatctgtcaaagccacagtaacc gaagccttcactgtggccagagaaaacggaagagcagaacagagggggcctcacctgata cctgcatgtgctggagcaggtggctgtcagcaccctgtgcggctgcctcaggctcagtgc aggagatggatgccctgtgctacagctgtgggagcagagctccaaggggttttggtcgct attctaagtgaggaatggagcctttcattttccgaggattttaagcctctgtccttactg agggcttttgttattccaggtgcaattgctggtatagttgatcagccgatgcagaacttc cagaaaacatctgaggcacaggcttcagcaggacacaaggccaagggtgtcatctcgggt gtggggaaaggaatcatgggggtgttcacaaagcccatcggaggagctgctgagctggtg tcacagactggctatggtattttacatggagctggactttctcagcttcccaaacagcgc catcagccaagtgatctacatgctgaccaggctccaaacagccatgtcaaatatgtctgg aaaatgcttcagtctctgggcagaccagaagtccacatggccctggacgtggttctggtg aggggctcaggccaggagcatgaagggtgcttgctgctgacatcagaagtgctcttcgtg gtgagtgtcagtgaggacacacagcagcaggccttccccgtcacagaaatcgactgtgca caggacagcaagcagaacaacttactcacagtgcagctcaagcagccaagagtggcctgt gatgtggaggtagatggagtccgagagagactgtcagagcaacagtacaacagactggtg gactacatcacaaagacatcttgtcacctggcccccagctgttcttccatgcaaatacca tgccctgtggtggctgcagaacctcccccctccactgttaaaacataccattacctggtt gatccacattttgctcaggtcttccttagtaaatttaccatggtgaaaaataaagccctg aggaaagggtttccttga >gi568815590r:99787508_99992021|GENSCAN_predicted_peptide_2|331_aa MKLDPFLTPYTKINPRWIKDLNVRPKTVKILEENLDNAIQDIGMGKDFMSKTPKAMATKA KIDKWDLIKLKSFCTAKETIIRVNRLFQYKQPGIIKAQEPHPSTPLSTNFQLPSFKILVF LRNPQKYRPSIKTSSALPVVTVCLRRGAASGLSRIYSLTLPLRPEGGRRADHGPGWVATA PPKPGGLRDPREPSWLRSELARGWYPLRFFTAIVTLVRSFDLSGPGVSPVRKGVVVSNAA GREAQGLAYVSRTVTTMAPEVLPKPRMRGLLARRLRNHMAVAFVLSLGVAALYKFRVADQ RKKAYADFYRNYDVMKDFEEMRKAGIFQSVK >gi568815590r:99787508_99992021|GENSCAN_predicted_CDS_2|996_bp atgaaactggaccccttccttacaccttatacaaaaattaacccaaggtggattaaagac ttaaacgtaagacctaaaaccgtaaaaatcctagaagaaaacctggacaatgccattcag gacataggcatgggcaaagacttcatgtctaaaacaccaaaagcaatggcaacaaaagcc aaaattgacaaatgggatcttattaaactaaagagcttctgcacagcaaaagaaactatc atcagagtgaacaggttatttcaatacaaacagcctggcatcattaaggcacaagagcct cacccttctacaccactttccacgaacttccagttgccttccttcaaaatccttgttttc ttaagaaacccacaaaaataccgcccctctatcaaaacgtcctcagctttgccagtcgtg actgtgtgtctccgccgtggtgcagcttcaggcctctcccgcatctactctctcacgctt ccgctgcggcctgagggagggcggcgggcggaccacggaccggggtgggttgcgacggcc ccaccgaagccgggtgggctgcgggaccctcgagaacccagctggcttcggagtgagctg gcccggggctggtacccgctgcgctttttcacagccattgtgaccttggtgaggtcattt gacctatctggccccggtgtttctcccgtgagaaaaggggtggtggtttctaatgcagct ggtcgtgaggcgcaagggttagcatacgtatcaaggacagtaactaccatggctcccgaa gttttgccaaaacctcggatgcgtggccttctggccaggcgtctgcgaaatcatatggct gtagcattcgtgctatccctgggggttgcagctttgtataagtttcgtgtggctgatcaa agaaagaaggcatacgcagatttctacagaaactacgatgtcatgaaagattttgaggag atgaggaaggctggtatctttcagagtgtaaagtaa >gi568815590r:99787508_99992021|GENSCAN_predicted_peptide_3|63_aa MTLNEHAAFKHLFNKAHLAPPLIHLTLSGHSTCFREHRVGETSAPCVAQDGEAFHYNHSV LQP >gi568815590r:99787508_99992021|GENSCAN_predicted_CDS_3|192_bp atgactcttaacgagcatgctgccttcaagcatctgtttaacaaagcacatcttgcaccg cccttaatccatttaaccctgagtggacacagcacgtgtttcagagagcaccgggttggg gaaaccagtgctccctgtgttgcccaggatggagaggcttttcactacaatcatagtgta ctgcagccttga >gi568815590r:99787508_99992021|GENSCAN_predicted_peptide_4|222_aa MAIIKRSENNSCLDLCHSHCDESVIQKKITTIINCFINSSIPPALQIDIPVEQAQKIIEH RKELGPYVFREAQFCEFRKNLTDENIMSVLERRQEYNKQKKKLAVLEDEKSGKDGIKQYA NTSVPAIKTALLSDSFLGLQPYGRQGASKVIGQCQSSAAKPRRSGKESVREPWARVPGAL GVAARSHKAIDDCSVVSYGSSINPNMFTLFSFQLLQITIPRD >gi568815590r:99787508_99992021|GENSCAN_predicted_CDS_4|669_bp atggctattattaaacggtcagaaaataacagctgcttggacttgtgccattctcattgt gatgagtctgtcatccagaagaagattacaactattatcaactgctttattaattccagt attccaccagctttacaaattgacattccagtagagcaagcccagaagattattgaacac cggaaggagttaggaccatatgtatttagagaggcacagttctgtgagtttaggaagaat ttaacagatgaaaatattatgagtgttttagagagaagacaagaatataataagcagaaa aaaaaattggcagtcctagaagacgaaaaatctggaaaggatggaatcaaacaatatgca aatacttcagtgcctgctatcaaaactgctttactcagtgattccttcctaggcctccaa ccatatggccgacagggggcttccaaggtgatcgggcagtgtcagtcttcagccgctaag ccgagaagatctgggaaggagtcagtcagagagccttgggccagagttccaggggctctg ggagtggctgccagatcacataaagccatagatgattgttcagtagtgagctatgggtcc tccatcaacccaaatatgttcactctcttttcatttcagttattacagataacaatacca agagattga >gi568815590r:99787508_99992021|GENSCAN_predicted_peptide_5|88_aa MDNDFDKLTEVGFRRLVITNFSELKEDVGTHRKEAKNLEKRLDEWLTRINSVEKILNDLM ELKTMARELRDACTSFNSQFDQMEEGYQ >gi568815590r:99787508_99992021|GENSCAN_predicted_CDS_5|267_bp atggacaatgactttgacaagctgacagaagtaggcttcagaaggttggtaataacaaat ttctctgagctaaaggaggatgttggaacccatcgcaaggaagctaaaaaccttgaaaaa agattagatgaatggctaactagaataaacagtgtagagaagatcttaaatgacctgatg gagctgaaaaccatggcacgagaacttcgcgacgcatgcacaagcttcaatagccaattc gatcaaatggaagaagggtatcagtga