GENSCAN 1.0 Date run: 3-Nov-116 Time: 04:11:16 Sequence gi568815596f:70158165_70377389 : 219225 bp : 43.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 Intr - 2623 2461 163 1 1 84 111 127 0.764 14.15 1.06 Intr - 7028 6928 101 0 2 41 78 17 0.820 -4.17 1.05 Intr - 7471 7364 108 2 0 109 92 27 0.944 5.46 1.04 Intr - 11497 11393 105 2 0 60 115 56 0.941 5.69 1.03 Intr - 17613 17509 105 1 0 70 110 27 0.892 3.29 1.02 Intr - 21478 21368 111 2 0 102 99 13 0.916 4.15 1.01 Init - 23821 22999 823 1 1 59 78 641 0.998 55.08 1.00 Prom - 27574 27535 40 -4.76 2.00 Prom + 42863 42902 40 -2.16 2.01 Init + 43809 44039 231 2 0 60 110 198 0.515 15.78 2.02 Intr + 44466 44606 141 2 0 54 76 56 0.308 1.55 2.03 Intr + 45183 45359 177 2 0 16 32 158 0.745 3.12 2.04 Term + 46047 46448 402 2 0 16 51 212 0.981 5.45 2.05 PlyA + 47543 47548 6 1.05 3.07 PlyA - 48647 48642 6 1.05 3.06 Term - 60837 60817 21 0 0 114 36 2 0.017 -4.09 3.05 Intr - 66465 66390 76 2 1 73 80 60 0.155 3.22 3.04 Intr - 69658 69571 88 2 1 62 97 60 0.311 3.33 3.03 Intr - 71154 71100 55 0 1 65 92 2 0.395 -3.15 3.02 Intr - 72690 72592 99 0 0 115 121 55 0.871 11.81 3.01 Init - 90223 90068 156 1 0 91 57 95 0.415 4.61 3.00 Prom - 94907 94868 40 -6.76 4.00 Prom + 98198 98237 40 -7.66 4.01 Init + 100001 100112 112 1 1 95 74 243 0.998 21.97 4.02 Intr + 101196 101402 207 1 0 94 71 189 0.981 16.85 4.03 Intr + 103048 103222 175 2 1 126 76 89 0.988 10.50 4.04 Intr + 116795 117006 212 2 2 85 94 323 0.999 31.16 4.05 Intr + 117350 117502 153 0 0 69 115 73 0.771 8.14 4.06 Term + 118570 119228 659 2 2 94 45 465 0.999 36.92 4.07 PlyA + 119341 119346 6 1.05 5.05 PlyA - 121152 121147 6 1.05 5.04 Term - 123520 123470 51 1 0 106 36 61 0.584 0.13 5.03 Intr - 130028 129904 125 0 2 83 103 61 0.770 7.40 5.02 Intr - 131208 131186 23 2 2 122 103 28 0.129 4.89 5.01 Init - 135485 135454 32 2 2 78 106 0 0.092 0.06 5.00 Prom - 136405 136366 40 -2.86 6.04 PlyA - 137838 137833 6 1.05 6.03 Term - 139313 139125 189 2 0 117 42 222 0.999 17.95 6.02 Intr - 142816 142676 141 1 0 131 94 181 0.996 23.55 6.01 Init - 143847 143761 87 0 0 108 74 146 0.735 15.87 6.00 Prom - 177339 177300 40 -3.26 7.03 PlyA - 177358 177353 6 1.05 7.02 Term - 199713 199618 96 0 0 85 41 92 0.422 2.17 7.01 Intr - 210179 210095 85 1 1 85 99 40 0.121 4.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 62302 62469 168 0 0 81 54 148 0.879 5.46 S.002 Term + 135068 135358 291 1 0 96 48 224 0.842 14.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:70158165_70377389|GENSCAN_predicted_peptide_1|506_aa MEPNSLRTKVPAFLSDLGKATLRGIRKCPRCGTYNGTRGLSCKNKTCGTIFRYGARKQPS VEAVKIITGSDLQVYSVRQRDRGPDYRCFVELGVSETTIQTVDGTIITQLSSGRCYVPSC LKAATQGVVENQCQHIKLAVNCQAEATPLTLKSSVLNAMQASPETKQTIWQLATEPTGPL VQRITKNILVVKCKASQKHSLGYLHTSFVQKVSGKSLPERRFFCSCQTLKSHKSNASKDE TAQRCIHFFACICAFASDETLAQEFSDFLNFDSSGLKEIIVPQLGCHSESTVSACESTAS KSKKRRKDEVSGAQMNSSLLPQDAVSSNLRKSGLKKPVVASSLKRQACGQLLDEAQVTLS FQDWLASVTERIHQTMHYQFDGKPEPLVFHIPQSFFDALQQRISIGSAKKRLPNSTTAFV RKDALPLGTFSKYTWHITNILQVKQILDTPEMPLEITRSFIQNRDGTYELFKCPKVEVES IAETYGRIEKQPVLRPLELKTFLKVX >gi568815596f:70158165_70377389|GENSCAN_predicted_CDS_1|1518_bp atggaaccaaattctctgaggactaaagtcccagctttcttatctgatttggggaaggcc acattgaggggaatcagaaagtgtccccgatgtggcacatacaatggaacccggggactg agctgtaagaacaagacatgtggaaccatattccgctacggtgcacgcaagcagcctagt gttgaagctgtcaaaatcattacaggctctgatcttcaggtctactcagtgcggcaaaga gaccggggccctgattaccgatgctttgtggagctcggggtttcagagacaacaatccag acagtggatgggacgatcatcactcagctgagctctggacggtgttatgtcccctcatgc ctgaaagctgccactcaaggcgttgtggaaaaccagtgccagcacatcaagctggcggtg aactgccaggcagaggccacccctctgaccctgaagagctcggtcctgaatgcaatgcag gcctccccggaaaccaaacagaccatctggcagttggccacggaacccacaggtcctctg gtgcagagaattactaaaaacatcttggtggtgaaatgcaaggcaagccagaagcacagt ttggggtatttgcatacatcttttgtgcagaaagtcagtggcaaaagcttgcctgagcgc cgcttcttctgctcctgtcagactctgaaatcgcacaagtcaaatgcctccaaggatgag acagcccagagatgcattcatttctttgcttgcatctgtgcctttgccagtgatgagaca ctggctcaggaattctcagacttcctaaattttgattccagcggtcttaaagagattatt gtaccccagttaggttgccattcagaatcaacagtatctgcttgtgagtctactgcctct aagtcaaagaagaggagaaaggatgaagtatctggtgcacagatgaacagttcactactg cctcaagatgcagtgagcagtaatctaaggaaaagtggcctgaaaaagcctgtggttgct tcctcgttaaaaaggcaggcctgtggtcagctgttagatgaggcacaagtgactttatcc ttccaagactggctggccagtgtcacagaacgcatccatcaaaccatgcactatcagttt gatggcaaaccagaaccattggtgttccacattcctcagtcattttttgatgccctgcaa caaagaatatctataggaagtgcaaaaaaacggctccccaactccaccacagcttttgtt cggaaagatgccttgccactgggaaccttttccaagtatacttggcatatcactaatatc ctgcaagttaaacaaatcttagataccccagagatgcccttggaaatcacccgtagcttt atccagaaccgagatgggacttatgagctatttaaatgccctaaagtggaagtagaaagc atagcagaaacctacggtcgtatagaaaaacaaccagtgctgcgacccttggaactaaaa acttttctcaaagttgnn >gi568815596f:70158165_70377389|GENSCAN_predicted_peptide_2|316_aa MWPQHGLRYLVAAVLSRWALVETWAEVDRSPMSMEKALKHLEAYNTEKEGAFASRVGWAF LTMLWKVHAQSLRDTAQVSRQLAQGKGDHILTEWLMAAMWTGWNDARELSKTVSKWQSYA ELVEDTRDQRPHVELAIHWYPTNVQQVLVLVGTDADYSLVYGKRDKFLGKAAYTDGYRGQ SVKMTVDYQELNKVTAPLHAAVLSITDLMDHLTMELGQYHYAVDLANASLSDDIAPESQE QFGFTWEGRQWTFTVLPQGYVHSPTICHGLVATDLATWKFPKGIRLFHYIDDIMLTSDSP ADLEAVVPLLQQHLAA >gi568815596f:70158165_70377389|GENSCAN_predicted_CDS_2|951_bp atgtggccccaacatgggttgcggtacttggtggcagctgtgctgtccagatgggccctg gtggaaacatgggcagaggtagacaggtcccccatgagcatggagaaggccctgaagcac ctggaagcatacaacaccgagaaggagggtgcctttgccagcagagttggatgggcattt ttgactatgctatggaaagtgcatgcccagtccctgcgggatacagcacaggtgagcagg cagttggcacaagggaaaggtgaccatatcctgactgaatggctgatggcagccatgtgg acagggtggaatgatgccagagaattatcaaaaactgtgagtaaatggcaatcatatgca gagctggtggaggacaccagagatcagaggccacatgtggaattggcaatccactggtac cccaccaatgtacagcaggtgttggtgctggtaggtactgatgcagattatagcctcgtc tatgggaaacgggataagtttttaggcaaggctgcatacacagatggttatagaggccag tcagtgaaaatgacagtggactatcaagaactaaataaagtaacagcccctttacatgca gcagtcctgtctatcacggatttgatggaccacctgacgatggaattgggacagtaccat tatgcagtggatttggctaatgcatccctttcagatgatatcgctccagagagccaggaa cagtttggcttcacatgggaagggcgacaatggactttcacagtgttgccacagggctac gtgcatagtcccaccatatgtcatggtctcgttgctacggatttagccacctggaaattt ccaaaggggatccgcctattccattacattgatgatattatgttaacctctgattctcct gcagatttagaagctgtggtgcccctcttgcaacaacatttggcagcatga >gi568815596f:70158165_70377389|GENSCAN_predicted_peptide_3|164_aa MREAGWWSSRREGLGPAPLPAYCSPWTPRYRGFCFLSMHLDAYNQDPICEDETAGNDPYC FVEFHEHRHAAAALAAMNGRKIMGKEVKVNWATTPSSQKKDTSNHFHVFVGDLSPEITTE DIKAAFAPFGRISDARVVKDMATGKSKGYGFVSFFNKWSFSAFQ >gi568815596f:70158165_70377389|GENSCAN_predicted_CDS_3|495_bp atgagggaggcgggatggtggtcgtcccggagggaaggcctcggccctgcgccgctccca gcctattgttctccgtggacaccgcgatatcgtggtttttgtttccttagcatgcacctg gatgcttacaaccaagacccaatctgtgaggacgagacagctggaaatgatccctattgt tttgtggagtttcatgagcatcgtcatgcagctgcagcattagctgctatgaatggacgg aagataatgggtaaggaagtcaaagtgaattgggcaacaacccctagcagtcaaaagaaa gatacaagcaatcatttccatgtctttgttggtgatctcagcccagaaattacaactgaa gatataaaagctgcttttgcaccatttggaagaatatcagatgcccgagtggtaaaagac atggcaacaggaaagtctaagggatatggctttgtctcctttttcaacaaatggtctttc tccgcatttcaataa >gi568815596f:70158165_70377389|GENSCAN_predicted_peptide_4|505_aa MGRVVAELVSSLLGLWLLLCSCGCPEGAELRAPPDKIAIIGAGIGGTSAAYYLRQKFGKD VKIDLFEREEVGGRLATMMVQGQEYEAGGSVIHPLNLHMKRFVKDLGLSAVQASGGLLGI YNGETLVFEESNWFIINVIKLVWRYGFQSLRMHMWVEDVLDKFMRIYRYQSHDYAFSSVE KLLHALGGDDFLGMLNRTLLETLQKAGFSEKFLNEMIAPVMRVNYGQSTDINAFVGAVSL SCSDSGLWAVEGGNKLVCSGLLQASKSNLISGSVMYIEEKTKTKYTGNPTKMYEVVYQIG TETRSDFYDIVLVATPLNRKMSNITFLNFDPPIEEFHQYYQHIVTTLVKGELNTSIFSSR PIDKFGLNTVLTTDNSDLFINSIGIVPSVREKEDPEPSTDGTYVWKIFSQETLTKAQILK LFLSYDYAVKKPWLAYPHYKPPEKCPSIILHDRLYYLNGIECAASAMEMSAIAAHNAALL AYHRWNGHTDMIDQDGLYEKLKTEL >gi568815596f:70158165_70377389|GENSCAN_predicted_CDS_4|1518_bp atggggcgcgtcgtcgcggagctcgtctcctcgctgctggggttgtggctgttgctgtgc agctgcggatgccccgagggcgccgagctgcgtgctccgccagataaaatcgcgattatt ggagccggaattggtggcacttcagcagcctattacctgcggcagaaatttgggaaagat gtgaagatagacctgtttgaaagagaagaggtcgggggccgcctggctaccatgatggtg caggggcaagaatacgaggcaggaggttctgtcatccatcctttaaatctgcacatgaaa cgttttgtcaaagacctgggtctctctgctgttcaggcctctggtggcctactggggata tataatggagagactctggtatttgaggagagcaactggttcataattaacgtgattaaa ttagtttggcgctatggatttcaatccctccgtatgcacatgtgggtagaggacgtgtta gacaagttcatgaggatctaccgctaccagtctcatgactatgccttcagtagtgtcgaa aaattacttcatgctctaggaggagatgacttccttggaatgcttaatcgaacacttctt gaaaccttgcaaaaggccggcttttctgagaagttcctcaatgaaatgattgctcctgtt atgagggtcaattatggccaaagcacggacatcaatgcctttgtgggggcggtgtcactg tcctgttctgattctggcctttgggcagtagaaggtggcaataaacttgtttgctcaggg cttctgcaggcatccaaaagcaatcttatatctggctcagtaatgtacatcgaggagaaa acaaagaccaagtacacaggaaatccaacaaagatgtatgaagtggtctaccaaattgga actgagactcgttcagacttctatgacatcgtcttggtggccactccgttgaatcgaaaa atgtcgaatattacttttctcaactttgatcctccaattgaggaattccatcaatattat caacatatagtgacaactttagttaagggggaattgaatacatctatctttagctctaga cccatagataaatttggccttaatacagttttaaccactgataattcagatttgttcatt aacagtattgggattgtgccctctgtgagagaaaaggaagatcctgagccatcaacagat ggaacatatgtttggaagatcttttcccaagaaactcttactaaagcacaaattttaaag ctctttctgtcctatgattatgctgtgaagaagccatggcttgcatatcctcactataag cccccggagaaatgcccctctatcattctccatgatcgactttattacctcaatggcata gagtgtgcagcaagtgccatggagatgagtgccattgcagcccacaacgctgcactcctt gcctatcaccgctggaacgggcacacagacatgattgatcaggatggcttatatgagaaa cttaaaactgaactatga >gi568815596f:70158165_70377389|GENSCAN_predicted_peptide_5|76_aa MSKAHPPELKKFMDKKLSLKLNGGRHVQGILRGFDPFMNLVIDECVEMATSGQQNNIGMV VIRGNSIIMLEALERV >gi568815596f:70158165_70377389|GENSCAN_predicted_CDS_5|231_bp atgagcaaagctcaccctcccgagttgaaaaaatttatggacaagaagttatcattgaaa ttaaatggtggcagacatgtccaaggaatattgcggggatttgatccctttatgaacctt gtgatagatgaatgtgtggagatggcgactagtggacaacagaacaatattggaatggtg gtaatacgaggaaatagtatcatcatgttagaagccttggaacgagtataa >gi568815596f:70158165_70377389|GENSCAN_predicted_peptide_6|138_aa MAELQQLRVQEAVESMVKSLERENIRKMQGLMFRCSASCCEDSQASMKQVHQCIERCHVP LAQAQALVTSELEKFQDRLARCTMHCNDKAKDSIDAGSKELQVKQQLDSCVTKCVDDHMH LIPTMTKKMKEALLSIGK >gi568815596f:70158165_70377389|GENSCAN_predicted_CDS_6|417_bp atggctgagctgcagcagctccgggtgcaggaggcggtggagtccatggtgaagagtctg gaaagagagaacatccggaagatgcagggtctcatgttccggtgcagcgccagctgttgt gaggacagccaggcctccatgaagcaggtgcaccagtgcatcgagcgctgccatgtgcct ctggctcaagcccaggctttggtcaccagtgagctggagaagttccaggaccgcctggcc cggtgcaccatgcattgcaacgacaaagccaaagattcaatagatgctgggagtaaggag cttcaggtgaagcagcagctggacagttgtgtgaccaagtgtgtggatgaccacatgcac ctcatcccaactatgaccaagaagatgaaggaggctctcttatcaattggaaaataa >gi568815596f:70158165_70377389|GENSCAN_predicted_peptide_7|60_aa XEKRLSTGINCDPPVNNLSDSFLTFDAGQHLEQEGQKAALVTTNELILLREAIRKRNFTP >gi568815596f:70158165_70377389|GENSCAN_predicted_CDS_7|183_bp nnggagaagagactcagcactggaatcaactgtgatccacctgtgaacaatttaagtgac tccttccttacttttgatgcagggcagcatttggagcaggaaggacagaaggctgctctt gtcaccaccaatgagctcatccttctccgagaggcaataagaaagcgaaatttcacccca tga