GENSCAN 1.0 Date run: 5-Nov-116 Time: 02:12:04 Sequence gi568815588f:110398122_110610423 : 212302 bp : 44.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 295 491 197 2 2 39 96 100 0.545 5.03 1.02 Intr + 10826 10941 116 1 2 88 89 7 0.268 0.05 1.03 Intr + 13063 13378 316 1 1 114 11 144 0.222 5.37 1.04 Intr + 52958 53072 115 1 1 36 67 194 0.914 12.12 1.05 Intr + 61831 61856 26 2 2 67 97 17 0.090 -1.76 1.06 Term + 64383 64529 147 2 0 69 43 100 0.092 1.50 1.07 PlyA + 65302 65307 6 1.05 2.02 PlyA - 66188 66183 6 1.05 2.01 Sngl - 72285 71554 732 0 0 82 47 227 0.715 14.23 2.00 Prom - 81568 81529 40 -6.26 3.00 Prom + 82169 82208 40 -5.46 3.01 Init + 94576 94589 14 0 2 99 96 2 0.389 1.82 3.02 Intr + 99673 100379 707 1 2 105 105 948 0.969 89.30 3.03 Intr + 104600 104748 149 0 2 100 98 172 0.977 19.35 3.04 Intr + 108814 109033 220 0 1 81 74 239 0.984 19.67 3.05 Term + 111899 112305 407 0 2 123 41 261 0.995 20.25 3.06 PlyA + 112617 112622 6 1.05 4.08 PlyA - 113503 113498 6 1.05 4.07 Term - 133596 133571 26 1 2 129 54 -4 0.055 -1.31 4.06 Intr - 141540 141444 97 0 1 74 86 37 0.289 1.68 4.05 Intr - 145117 144963 155 2 2 43 102 59 0.297 2.59 4.04 Intr - 146303 146213 91 2 1 29 94 51 0.189 -0.63 4.03 Intr - 153397 153328 70 0 1 132 75 26 0.129 4.88 4.02 Intr - 153952 153866 87 0 0 108 66 40 0.125 2.89 4.01 Init - 161022 160976 47 0 2 77 81 13 0.223 -0.30 4.00 Prom - 163589 163550 40 -4.76 5.00 Prom + 165055 165094 40 -3.06 5.01 Init + 169696 169710 15 0 0 71 110 -8 0.578 0.05 5.02 Intr + 170817 170892 76 2 1 79 111 40 0.834 4.49 5.03 Intr + 177215 177282 68 0 2 61 84 42 0.838 -0.28 5.04 Intr + 179300 179371 72 1 0 102 100 69 0.997 9.10 5.05 Intr + 179714 179793 80 1 2 57 87 107 0.909 5.85 5.06 Intr + 180507 180585 79 0 1 42 93 42 0.880 -0.35 5.07 Intr + 182783 182900 118 1 1 55 98 59 0.945 3.64 5.08 Intr + 183802 183977 176 2 2 69 83 127 0.997 9.96 5.09 Intr + 184441 184521 81 0 0 47 94 82 0.952 4.53 5.10 Intr + 185263 185427 165 0 0 54 110 136 0.999 12.56 5.11 Intr + 185720 185841 122 1 2 78 13 119 0.998 2.69 5.12 Intr + 186062 186275 214 2 1 64 64 170 0.998 10.92 5.13 Intr + 191484 191577 94 2 1 89 1 112 0.763 2.14 5.14 Intr + 191771 191870 100 0 1 62 86 128 0.763 9.17 5.15 Intr + 192870 193011 142 0 1 54 82 47 0.862 1.06 5.16 Intr + 194952 195102 151 2 1 75 76 80 0.892 5.24 5.17 Term + 198277 198452 176 2 2 77 40 87 0.815 0.72 5.18 PlyA + 198664 198669 6 1.05 6.00 Prom + 198847 198886 40 -2.86 6.01 Init + 200116 200169 54 0 0 67 83 59 0.554 4.58 6.02 Intr + 201548 201691 144 1 0 -10 111 113 0.577 4.28 6.03 Intr + 202901 203009 109 1 1 83 63 45 0.975 1.36 6.04 Intr + 203516 203763 248 0 2 64 77 240 0.999 17.68 6.05 Intr + 203845 204057 213 0 0 72 91 193 0.999 16.91 6.06 Intr + 204353 204470 118 1 1 70 69 112 0.865 7.54 6.07 Intr + 204717 204881 165 1 0 80 92 75 0.921 7.03 6.08 Intr + 205063 205169 107 2 2 61 86 64 0.908 3.43 6.09 Term + 206110 206181 72 0 0 82 32 67 0.868 -1.59 6.10 PlyA + 207416 207421 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 126617 126553 65 2 2 72 110 35 0.851 4.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:110398122_110610423|GENSCAN_predicted_peptide_1|305_aa XALGEIFSGHQDSLPKLESLYLWARAMEVLEPSRKKSSKGFGSDLFQCASPQAKKGFSLQ PGDGPQATNVPTQGAGMLLSSLIWDLTGLAPESAQVISGSPSGVFKLSSGLRLVGPKPRV DSFILSGLNQAASGGTPGDPPALFGRRGGSQIRNHPGTMGPSYGSPHRWHHSPESGEYEL REALSKVSKEENKMKELPYRALEEDKYLQVAAKLKEKYDKDVADSKSKGKFDGAKCPAEV ARKKVEEEGTPGNAYTKMIDTKKAHVLRALTFNGEGSEPKQINKAIAKTEKCHEENASWW CGTEG >gi568815588f:110398122_110610423|GENSCAN_predicted_CDS_1|918_bp naagccctgggagagatattctccgggcaccaggactcactgcccaagctcgagtcgctg tacctctgggccagggcaatggaggtcctggaaccttccaggaaaaagagttcaaaaggt tttggcagtgaccttttccagtgtgcttccccccaggcaaagaaaggtttttcactgcag cctggggatgggcctcaggctactaatgttcccacacagggtgcagggatgctgctttcc tctctgatatgggacctcactggactcgctcccgagtctgcccaagtcatctctggaagt ccctcgggtgtcttcaagctttcttcaggcctgcggctggtgggccccaagccaagagtg gattctttcatactctctggtctaaatcaagctgcatcaggtggcactcccggtgaccct cctgccttatttggaagaagaggaggaagccagataagaaaccatccaggcaccatgggc ccttcttatgggtcaccccacagatggcatcattctccagagagtggggaatatgagctc cgagaggccctgagcaaggtgtcaaaggaagagaataaaatgaaggagctgccttatagg gccctagaggaggacaaatatttgcaagtggcggcaaagctgaaggagaagtatgataag gatgttgccgactctaagtcgaaaggaaagtttgatggtgcaaagtgtcctgctgaagtt gcccggaaaaaggtggaagaggaaggaaccccaggaaatgcatacaccaagatgatcgat accaaaaaggcccatgtgctcagggcattgacattcaatggagagggctcagagcccaaa caaatcaacaaggcaatcgccaagactgagaaatgccatgaagaaaatgcatcctggtgg tgtgggacggagggatag >gi568815588f:110398122_110610423|GENSCAN_predicted_peptide_2|243_aa MIVYLENPIVSAQNLLKLISNFSKVSEYKINVQKSQVFLYIINTQTESQIMSELPFTIAT KRIKYLGIQLTRDVKHLFKENYKPLLNKIKEDTNKWKNIPCSWIGRINIMKMAILPKVIY RFSAIPVKLPMTFFTELEKTTLKFIWNQKRARIAKTILSKKNKAGGITLPDFKLYYKATV TKTAWYSYQNRDIDQWNRTEASEITPHIYNHLIFDKPNKNKKWGKDSFLTNGAGKTGYPY LES >gi568815588f:110398122_110610423|GENSCAN_predicted_CDS_2|732_bp atgattgtatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagc aacttcagcaaagtctcagaatacaaaatcaatgtgcaaaaatcacaagtattcctatac atcattaacacacaaacagagagccaaatcatgagtgaactcccattcacaattgctaca aagagaataaaatacctaggaatccaacttacaagggatgtgaagcacctcttcaaggag aactacaaaccactgctcaacaaaataaaagaggacacaaacaaatggaagaatattcca tgctcatggataggaagaatcaatatcatgaaaatggccatactgcccaaagtaatttat agattcagtgccatccccgtcaagctaccaatgactttcttcacagaattggaaaaaact actttaaagttcatatggaaccaaaaaagagcccgcattgctaagactatcctaagcaaa aagaacaaagctggaggcatcacgttacctgacttcaaactatactacaaggctacagta accaaaacagcatggtactcataccaaaacagagacatagaccaatggaacagaacagag gcctcagaaataacaccacacatctacaaccatctgatctttgacaaacctaacaaaaac aagaaatggggaaaggattcctttttaacaaatggtgctgggaaaactggctacccatat ctagaaagctga >gi568815588f:110398122_110610423|GENSCAN_predicted_peptide_3|498_aa MALRGAWPRPEGRGNTHIWPYMGSRVTDGTHSHKTLRGRRNPRLLGRRAAGLAIERAGRE RGVAPPLGRRAPSRPQPRGSPSRASPADTLAVDTLAVGTRGARGAGPLAGGGGGMKVTSL DGRQLRKMLRKEAAARCVVLDCRPYLAFAASNVRGSLNVNLNSVVLRRARGGAVSARYVL PDEAARARLLQEGGGGVAAVVVLDQGSRHWQKLREESAARVVLTSLLACLPAGPRVYFLK GGYETFYSEYPECCVDVKPISQEKIESERALISQCGKPVVNVSYRPAYDQGGPVEILPFL YLGSAYHASKCEFLANLHITALLNVSRRTSEACATHLHYKWIPVEDSHTADISSHFQEAI DFIDCVREKGGKVLVHCEAGISRSPTICMAYLMKTKQFRLKEAFDYIKQRRSMVSPNFGF MGQLLQYESEILPSTPNPQPPSCQGEAAGSSLIGHLQTLSPDMQGAYCTFPASVLAPVPT HSTVSELSRSPVATATSC >gi568815588f:110398122_110610423|GENSCAN_predicted_CDS_3|1497_bp atggctctaagaggggcgtggccacggcccgaggggcggggaaacacccatatttggcct tatatgggcagccgcgtcacggacggcactcattcacataaaacgctgcgcggccggcgg aatccccggcttctagggcggcgagcggccgggctggctatcgagcgagcggggcgggaa cgcggagttgcgccgccgctcgggcgccgggctccgtcgcggccgcagccccgcgggtcg ccctcccgtgcctcgcccgcggacaccctggccgtggacaccctggccgtgggcacccgc ggggcgcgcggcgcggggccgctggccggcggcggcggcggcatgaaggtcacgtcgctc gacgggcgccagctgcgcaagatgctccgcaaggaggcggcggcgcgctgcgtggtgctc gactgccggccctatctggccttcgctgcctcgaacgtgcgcggctcgctcaacgtcaac ctcaactcggtggtgctgcggcgggcccggggcggcgcggtgtcggcgcgctacgtgctg cccgacgaggcggcgcgcgcgcggctcctgcaggagggcggcggcggcgtcgcggccgtg gtggtgctggaccagggcagccgccactggcagaagctgcgagaggagagcgccgcgcgt gtcgtcctcacctcgctactcgcttgcctacccgccggcccgcgggtctacttcctcaaa gggggatatgagactttctactcggaatatcctgagtgttgcgtggatgtaaaacccatt tcacaagagaagattgagagtgagagagccctcatcagccagtgtggaaaaccagtggta aatgtcagctacaggccagcttatgaccagggtggcccagttgaaatccttcccttcctc taccttggaagtgcctaccatgcatccaagtgcgagttcctcgccaacctgcacatcaca gccctgctgaatgtctcccgacggacctccgaggcctgcgcgacccacctacactacaaa tggatccctgtggaagacagccacacggctgacattagctcccactttcaagaagcaata gacttcattgactgtgtcagggaaaagggaggcaaggtcctggtccactgtgaggctggg atctcccgttcacccaccatctgcatggcttaccttatgaagaccaagcagttccgcctg aaggaggccttcgattacatcaagcagaggaggagcatggtctcgcccaactttggcttc atgggccagctcctgcagtacgaatctgagatcctgccctccacgcccaacccccagcct ccctcctgccaaggggaggcagcaggctcttcactgataggccatttgcagacactgagc cctgacatgcagggtgcctactgcacattccctgcctcggtgctggcaccggtgcctacc cactcaacagtctcagagctcagcagaagccctgtggcaacggccacatcctgctaa >gi568815588f:110398122_110610423|GENSCAN_predicted_peptide_4|190_aa MKVFQQRRTTHLKALKALNLPSSLHLYIPEKEATQNLSALGQISRLGPEEDSEGWESVSG KSSAYLTLQNLEGIVKSFVSETHGLQIRKLRPEKVACPASFTLASSEGSGSFERSSIFGT YQQLDDAPAYVNPGHWEECSVSNAQVDQGKLVLIQSSLVLSCDSVAGFPDVAAGLVQDLQ GRGLILSCGS >gi568815588f:110398122_110610423|GENSCAN_predicted_CDS_4|573_bp atgaaggtatttcagcagaggagaaccactcacctgaaggcactgaaggcactgaatttg ccttcatccctgcatttgtacatcccagaaaaggaagccacacagaacttatcagccctg ggccagatatccaggcttggtcctgaggaagatagtgaaggctgggaatccgtgtctggg aagtcatcagcctaccttacactacaaaacctggaagggatcgtgaagagttttgtgtca gagactcatggtctgcagataaggaaactcaggccagagaaagtggcttgcccagcctcc ttcactctggcctccagtgaagggagtgggtcatttgaaagaagctctatctttggtaca taccagcagctggatgatgccccagcatatgttaacccagggcattgggaggaatgttca gtttccaatgctcaagttgaccaaggaaagttggtcctaatccagtcatccctggtgcta tcttgtgacagtgtggctggttttccagatgtagctgcagggctggtccaagacctccaa ggcaggggattgatcctttcctgtggaagctga >gi568815588f:110398122_110610423|GENSCAN_predicted_peptide_5|642_aa MYIKQVIIQGFRSYRDQTIVDPFSSKHNVIAIQFVLSDEFSHLRPEQRLALLHEGTGPRV ISAFVEIIFDNSDNRLPIDKEEVSLRRVIGAKKDQYFLDKKMVTKNDVMNLLESAGFSRS NPYYIVKQGKINQMATAPDSQRLKLLREVAGTRVYDERKEESISLMKETEGKREKINELL KYIEERLHTLEEEKEELAQYQKWDKMRRALEYTIYNQELNETRAKLDELSAKRETSGEKS RQLRDAQQDARDKMEDIERQVRELKTKISAMKEEKEQLSAERQEQIKQRTKLELKAKDLQ DELAGNSEQRKRLLKERQKLLEKIEEKQKELAETEPKFNSVKEKEERGIARLAQATQERT DLYAKQGRGSQFTSKEERDKWIKKELKSLDQAINDKKRQIAAIHKDLEDTEANKEKNLEQ YNKLDQDLNEVKARVEELDRKYYEVKNKKDELQTTCGEKRMQNSKHLLLKEKILKRSNNF LEQQQERLFYHIVDSDEVSTKILMEFNKMNLPGEVTFLPLNKLDVRDTAYPETNDAIPMI SKLRYNPRFDKAFKHVFGKTLICRSMEVSTQLARAFTMDCITLEGDQVSHRGALTGGYYD TRKSRLELQKDVRKAEEELGELEAKLNENLRRNIENIFLFCA >gi568815588f:110398122_110610423|GENSCAN_predicted_CDS_5|1929_bp atgtacataaagcaggtgattatccagggttttcgaagttacagagatcaaacaattgta gatcccttcagttcaaaacataatgtgattgcaattcagtttgttctcagtgatgagttt agtcatcttcgtccagaacagcggttggctttattgcatgaaggtactggtcctcgtgtt atttctgcttttgtggagattatttttgataattcagacaaccggttaccaatcgataaa gaggaagtttcacttcgaagagttattggtgccaaaaaggatcagtatttcttagacaag aagatggtcacgaaaaatgatgtgatgaacctccttgaaagcgctggtttttctcgaagc aatccttattatattgttaaacaaggaaagatcaaccagatggcaacagcaccagattct cagagattaaagctattaagagaagtagctggtactagagtgtatgacgaacgaaaggaa gaaagcatctccttaatgaaagaaacagagggcaaacgggaaaaaatcaatgagttgtta aaatacattgaagagagattacatactctagaggaagaaaaggaagaactagctcagtat cagaagtgggataaaatgagacgagccctggaatataccatttacaatcaggaacttaac gagactcgtgccaaacttgatgagctttctgctaagcgagagactagtggagaaaaatcc agacaattaagagatgctcagcaggatgcaagagataaaatggaggatatcgaacgccaa gttagagaattgaaaacaaaaatttcagctatgaaagaagaaaaagaacagcttagtgct gaaagacaagagcagattaagcagaggactaagttggagcttaaagccaaggatttacaa gatgaactagcaggcaatagtgaacaaaggaaacgtttattaaaagagaggcagaagctg cttgaaaaaatagaagaaaagcagaaagaactggcagaaacagaacccaaattcaacagt gtgaaagagaaagaagaacgaggaattgctagattggctcaagctacccaggaaagaacg gatctttatgcaaagcagggtcgaggaagccagtttacatcaaaagaagaaagggataag tggattaaaaaggaactcaagtctttagatcaggctattaatgacaagaaaagacagatt gctgctatacataaggatttggaagacactgaagcaaataaagagaaaaatctggagcag tataataaactggaccaggatcttaatgaagtcaaagctcgagtagaagaactggacaga aaatattacgaagtaaaaaataagaaagatgaactacaaactacttgtggagagaagaga atgcagaacagcaagcacttgctgctaaaagagaagatcttgaaaagaagcaacaacttc ttagagcagcaacaggaaaggttattttatcacattgttgattcagatgaagtcagcacg aagattttaatggagtttaataaaatgaatcttcctggagaggttacttttctgcctctt aacaagttagatgtcagggatacagcctatcctgaaaccaatgatgctattcctatgatc agcaaactgaggtacaatcccagatttgacaaagctttcaaacatgtgtttggaaagact cttatttgtcgtagcatggaagtttcaacccagctggcccgtgctttcactatggactgt attactttggaaggtgaccaagtcagccatcggggtgctctaactgggggttattatgac acaaggaagtctcgacttgaattgcaaaaagatgttagaaaagcagaagaagaactaggt gaacttgaagcaaagctcaatgaaaacctgcgcagaaatattgaaaatatctttttgttt tgtgcatag >gi568815588f:110398122_110610423|GENSCAN_predicted_peptide_6|409_aa MKMLKEKRQQSEKTFMPKSLEASLHAMESTRESLKAELGTDLLSQLSLEDQKRVDALNDE IRQLQQELNELRETEGGTVLTATTSELEAINKRVKDTMARSEDLDNSIDKTEAGIKELQK SMERWKNMEKEHMDAINHDTKELEKMTNRQGMLLKKKEECMKKIRELGSLPQEAFEKYQT LSLKQLFRKLEQCNTELKKYSHVNKKALDQFVNFSEQKEKLIKRQEELDRGYKSIMELMN VLELRKYEAIQLTFKQVSKNFSEVFQKLVPGGKATLVMKKGDVEGSQSQDEGEGSGKQGE MREMQQLSGGQKSLVALALIFAIQKCDPAPFYLFDEIDQALDAQHRKAVSDMIMELAVHA QFITTTFRPELLESADKFYGVKFRNKVSHIDVITAEMAKDFVEDDTTHG >gi568815588f:110398122_110610423|GENSCAN_predicted_CDS_6|1230_bp atgaagatgctaaaagagaagaggcagcagtcagagaaaaccttcatgcctaagagtttg gaggcaagcttgcatgctatggagtctaccagagagtcattgaaagcagaactgggaact gatttgctttctcaactgagtttggaagatcagaagagagtagatgcactgaatgatgag attcgtcaacttcagcaggaacttaatgagctgagagagacagaagggggtactgttctc acagccacaacatcagaacttgaagccatcaataaaagagtaaaagacactatggcacga tcagaagatttggacaattccattgataaaacagaagctggaattaaggagcttcagaag agtatggagcgctggaaaaatatggaaaaagaacatatggatgctataaatcatgatact aaagaactggaaaagatgacaaatcggcaaggcatgctattgaagaagaaagaagagtgt atgaagaaaattcgagaacttggatcacttccccaggaagcatttgaaaagtaccagaca ctgagcctcaaacagttgtttcgaaaacttgagcagtgcaacacagaattaaagaagtac agccatgttaacaaaaaggctttggatcagtttgtaaatttctccgagcagaaagaaaag ttaataaagcgtcaagaagagttagataggggttacaaatcaatcatggaactgatgaat gtacttgaacttcggaaatatgaagctattcagttaactttcaaacaggtatctaagaac ttcagtgaagtattccagaagttagtacctggtggcaaagctactttggtgatgaagaaa ggagatgtggagggcagtcagtctcaagatgaaggagaagggagtggaaaacaaggtgaa atgagagaaatgcaacagctttcaggtggacagaaatccttggtagcccttgctctgatt tttgccattcagaaatgtgacccggctccattttacttgtttgatgaaattgaccaggct ctggatgctcagcacagaaaggctgtgtcagatatgattatggaacttgctgtacatgct cagtttattacaactacttttaggcctgaactgcttgagtcagctgacaaattctatggt gtaaagttcagaaataaggttagtcatattgatgtgatcacagcagagatggccaaagac tttgtagaagatgataccacacatggttaa