GENSCAN 1.0 Date run: 2-Nov-116 Time: 18:46:38 Sequence gi568815596r:70112722_70336175 : 223454 bp : 44.67% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 29936 30094 159 1 0 55 53 149 0.636 6.04 1.02 PlyA + 31613 31618 6 1.05 2.00 Prom + 33058 33097 40 -6.26 2.01 Sngl + 35851 36138 288 0 0 97 48 153 0.678 6.90 2.02 PlyA + 36710 36715 6 1.05 3.09 PlyA - 37192 37187 6 1.05 3.08 Term - 37843 37635 209 2 2 91 42 249 0.951 18.10 3.07 Intr - 48066 47904 163 0 1 84 111 127 0.963 14.15 3.06 Intr - 52471 52371 101 2 2 41 78 17 0.819 -4.17 3.05 Intr - 52914 52807 108 1 0 109 92 27 0.943 5.46 3.04 Intr - 56940 56836 105 1 0 60 115 56 0.940 5.69 3.03 Intr - 63056 62952 105 0 0 70 110 27 0.892 3.29 3.02 Intr - 66921 66811 111 1 0 102 99 13 0.916 4.15 3.01 Init - 69264 68442 823 0 1 59 78 641 0.998 55.08 3.00 Prom - 73017 72978 40 -4.76 4.00 Prom + 88306 88345 40 -2.16 4.01 Init + 89252 89482 231 1 0 60 110 198 0.515 15.78 4.02 Intr + 89909 90049 141 1 0 54 76 56 0.308 1.55 4.03 Intr + 90626 90802 177 1 0 16 32 158 0.745 3.12 4.04 Term + 91490 91891 402 1 0 16 51 212 0.981 5.45 4.05 PlyA + 92986 92991 6 1.05 5.07 PlyA - 94090 94085 6 1.05 5.06 Term - 106280 106260 21 2 0 114 36 2 0.017 -4.09 5.05 Intr - 111908 111833 76 1 1 73 80 60 0.155 3.22 5.04 Intr - 115101 115014 88 1 1 62 97 60 0.311 3.33 5.03 Intr - 116597 116543 55 2 1 65 92 2 0.395 -3.15 5.02 Intr - 118133 118035 99 2 0 115 121 55 0.871 11.81 5.01 Init - 135666 135511 156 0 0 91 57 95 0.415 4.61 5.00 Prom - 140350 140311 40 -6.76 6.00 Prom + 143641 143680 40 -7.66 6.01 Init + 145444 145555 112 0 1 95 74 243 0.998 21.97 6.02 Intr + 146639 146845 207 0 0 94 71 189 0.981 16.85 6.03 Intr + 148491 148665 175 1 1 126 76 89 0.988 10.50 6.04 Intr + 162238 162449 212 1 2 85 94 323 0.999 31.16 6.05 Intr + 162793 162945 153 2 0 69 115 73 0.771 8.14 6.06 Term + 164013 164671 659 1 2 94 45 465 0.999 36.92 6.07 PlyA + 164784 164789 6 1.05 7.05 PlyA - 166595 166590 6 1.05 7.04 Term - 168963 168913 51 0 0 106 36 61 0.584 0.13 7.03 Intr - 175471 175347 125 2 2 83 103 61 0.770 7.40 7.02 Intr - 176651 176629 23 1 2 122 103 28 0.129 4.89 7.01 Init - 180928 180897 32 1 2 78 106 0 0.092 0.06 7.00 Prom - 181848 181809 40 -2.86 8.04 PlyA - 183281 183276 6 1.05 8.03 Term - 184756 184568 189 1 0 117 42 222 0.999 17.95 8.02 Intr - 188259 188119 141 0 0 131 94 181 0.996 23.55 8.01 Init - 189290 189204 87 2 0 108 74 146 0.735 15.87 8.00 Prom - 222782 222743 40 -3.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 107745 107912 168 2 0 81 54 148 0.879 5.46 S.002 Term + 180511 180801 291 0 0 96 48 224 0.842 14.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_1|52_aa RATYLAPPLRATQGTPVTSPPSGALIPALSRKDAKVVTCPVRQGGCAGGIGN >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_1|159_bp cgcgccacgtacctggccccgcccctgcgagccacgcagggaaccccggtgacgtcacca ccctccggcgctctcattcccgcgctctccagaaaagacgcgaaggtggtgacgtgtccc gtgcgccagggcggctgcgcaggaggcattggcaactga >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_2|95_aa MPEPSTHSMGSCAARASLTSTTPCSTVPSPIDHPRAEECERMAQDWQAAPPAAPVWDPLG EASWAPESGGECGVFISSSGIVNTPISTLCLAQGL >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_2|288_bp atgcctgagccttccacccactccatgggctcctgtgcggcccgagcctccctgacgagc accaccccctgctccacagtgcccagtcccatcgaccacccaagggctgaggaatgtgag cgcatggcgcaggactggcaggcagctccacctgcagccccagtgtgggatccactgggt gaagccagctgggctcctgagtctgggggggaatgtggagtctttatatctagctcaggg attgtaaatacaccaatcagcaccctgtgtttagctcaaggtttgtga >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_3|574_aa MEPNSLRTKVPAFLSDLGKATLRGIRKCPRCGTYNGTRGLSCKNKTCGTIFRYGARKQPS VEAVKIITGSDLQVYSVRQRDRGPDYRCFVELGVSETTIQTVDGTIITQLSSGRCYVPSC LKAATQGVVENQCQHIKLAVNCQAEATPLTLKSSVLNAMQASPETKQTIWQLATEPTGPL VQRITKNILVVKCKASQKHSLGYLHTSFVQKVSGKSLPERRFFCSCQTLKSHKSNASKDE TAQRCIHFFACICAFASDETLAQEFSDFLNFDSSGLKEIIVPQLGCHSESTVSACESTAS KSKKRRKDEVSGAQMNSSLLPQDAVSSNLRKSGLKKPVVASSLKRQACGQLLDEAQVTLS FQDWLASVTERIHQTMHYQFDGKPEPLVFHIPQSFFDALQQRISIGSAKKRLPNSTTAFV RKDALPLGTFSKYTWHITNILQVKQILDTPEMPLEITRSFIQNRDGTYELFKCPKVEVES IAETYGRIEKQPVLRPLELKTFLKVGNTSPDQKEPTPFIIEWIPDILPQSKIGELRIKFE YGHHRNGHVAEYQDQRPPLDQPLELAPLTTITFP >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_3|1725_bp atggaaccaaattctctgaggactaaagtcccagctttcttatctgatttggggaaggcc acattgaggggaatcagaaagtgtccccgatgtggcacatacaatggaacccggggactg agctgtaagaacaagacatgtggaaccatattccgctacggtgcacgcaagcagcctagt gttgaagctgtcaaaatcattacaggctctgatcttcaggtctactcagtgcggcaaaga gaccggggccctgattaccgatgctttgtggagctcggggtttcagagacaacaatccag acagtggatgggacgatcatcactcagctgagctctggacggtgttatgtcccctcatgc ctgaaagctgccactcaaggcgttgtggaaaaccagtgccagcacatcaagctggcggtg aactgccaggcagaggccacccctctgaccctgaagagctcggtcctgaatgcaatgcag gcctccccggaaaccaaacagaccatctggcagttggccacggaacccacaggtcctctg gtgcagagaattactaaaaacatcttggtggtgaaatgcaaggcaagccagaagcacagt ttggggtatttgcatacatcttttgtgcagaaagtcagtggcaaaagcttgcctgagcgc cgcttcttctgctcctgtcagactctgaaatcgcacaagtcaaatgcctccaaggatgag acagcccagagatgcattcatttctttgcttgcatctgtgcctttgccagtgatgagaca ctggctcaggaattctcagacttcctaaattttgattccagcggtcttaaagagattatt gtaccccagttaggttgccattcagaatcaacagtatctgcttgtgagtctactgcctct aagtcaaagaagaggagaaaggatgaagtatctggtgcacagatgaacagttcactactg cctcaagatgcagtgagcagtaatctaaggaaaagtggcctgaaaaagcctgtggttgct tcctcgttaaaaaggcaggcctgtggtcagctgttagatgaggcacaagtgactttatcc ttccaagactggctggccagtgtcacagaacgcatccatcaaaccatgcactatcagttt gatggcaaaccagaaccattggtgttccacattcctcagtcattttttgatgccctgcaa caaagaatatctataggaagtgcaaaaaaacggctccccaactccaccacagcttttgtt cggaaagatgccttgccactgggaaccttttccaagtatacttggcatatcactaatatc ctgcaagttaaacaaatcttagataccccagagatgcccttggaaatcacccgtagcttt atccagaaccgagatgggacttatgagctatttaaatgccctaaagtggaagtagaaagc atagcagaaacctacggtcgtatagaaaaacaaccagtgctgcgacccttggaactaaaa acttttctcaaagttggcaacacttccccagatcaaaaggagccaacacctttcatcatc gagtggatcccagatatccttccccaatctaagattggcgagctgcggatcaagtttgag tatggccaccaccggaatgggcatgtggcggagtaccaagaccagcggccccccttggac cagcccttggaactggcccctctgaccactattactttcccttaa >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_4|316_aa MWPQHGLRYLVAAVLSRWALVETWAEVDRSPMSMEKALKHLEAYNTEKEGAFASRVGWAF LTMLWKVHAQSLRDTAQVSRQLAQGKGDHILTEWLMAAMWTGWNDARELSKTVSKWQSYA ELVEDTRDQRPHVELAIHWYPTNVQQVLVLVGTDADYSLVYGKRDKFLGKAAYTDGYRGQ SVKMTVDYQELNKVTAPLHAAVLSITDLMDHLTMELGQYHYAVDLANASLSDDIAPESQE QFGFTWEGRQWTFTVLPQGYVHSPTICHGLVATDLATWKFPKGIRLFHYIDDIMLTSDSP ADLEAVVPLLQQHLAA >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_4|951_bp atgtggccccaacatgggttgcggtacttggtggcagctgtgctgtccagatgggccctg gtggaaacatgggcagaggtagacaggtcccccatgagcatggagaaggccctgaagcac ctggaagcatacaacaccgagaaggagggtgcctttgccagcagagttggatgggcattt ttgactatgctatggaaagtgcatgcccagtccctgcgggatacagcacaggtgagcagg cagttggcacaagggaaaggtgaccatatcctgactgaatggctgatggcagccatgtgg acagggtggaatgatgccagagaattatcaaaaactgtgagtaaatggcaatcatatgca gagctggtggaggacaccagagatcagaggccacatgtggaattggcaatccactggtac cccaccaatgtacagcaggtgttggtgctggtaggtactgatgcagattatagcctcgtc tatgggaaacgggataagtttttaggcaaggctgcatacacagatggttatagaggccag tcagtgaaaatgacagtggactatcaagaactaaataaagtaacagcccctttacatgca gcagtcctgtctatcacggatttgatggaccacctgacgatggaattgggacagtaccat tatgcagtggatttggctaatgcatccctttcagatgatatcgctccagagagccaggaa cagtttggcttcacatgggaagggcgacaatggactttcacagtgttgccacagggctac gtgcatagtcccaccatatgtcatggtctcgttgctacggatttagccacctggaaattt ccaaaggggatccgcctattccattacattgatgatattatgttaacctctgattctcct gcagatttagaagctgtggtgcccctcttgcaacaacatttggcagcatga >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_5|164_aa MREAGWWSSRREGLGPAPLPAYCSPWTPRYRGFCFLSMHLDAYNQDPICEDETAGNDPYC FVEFHEHRHAAAALAAMNGRKIMGKEVKVNWATTPSSQKKDTSNHFHVFVGDLSPEITTE DIKAAFAPFGRISDARVVKDMATGKSKGYGFVSFFNKWSFSAFQ >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_5|495_bp atgagggaggcgggatggtggtcgtcccggagggaaggcctcggccctgcgccgctccca gcctattgttctccgtggacaccgcgatatcgtggtttttgtttccttagcatgcacctg gatgcttacaaccaagacccaatctgtgaggacgagacagctggaaatgatccctattgt tttgtggagtttcatgagcatcgtcatgcagctgcagcattagctgctatgaatggacgg aagataatgggtaaggaagtcaaagtgaattgggcaacaacccctagcagtcaaaagaaa gatacaagcaatcatttccatgtctttgttggtgatctcagcccagaaattacaactgaa gatataaaagctgcttttgcaccatttggaagaatatcagatgcccgagtggtaaaagac atggcaacaggaaagtctaagggatatggctttgtctcctttttcaacaaatggtctttc tccgcatttcaataa >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_6|505_aa MGRVVAELVSSLLGLWLLLCSCGCPEGAELRAPPDKIAIIGAGIGGTSAAYYLRQKFGKD VKIDLFEREEVGGRLATMMVQGQEYEAGGSVIHPLNLHMKRFVKDLGLSAVQASGGLLGI YNGETLVFEESNWFIINVIKLVWRYGFQSLRMHMWVEDVLDKFMRIYRYQSHDYAFSSVE KLLHALGGDDFLGMLNRTLLETLQKAGFSEKFLNEMIAPVMRVNYGQSTDINAFVGAVSL SCSDSGLWAVEGGNKLVCSGLLQASKSNLISGSVMYIEEKTKTKYTGNPTKMYEVVYQIG TETRSDFYDIVLVATPLNRKMSNITFLNFDPPIEEFHQYYQHIVTTLVKGELNTSIFSSR PIDKFGLNTVLTTDNSDLFINSIGIVPSVREKEDPEPSTDGTYVWKIFSQETLTKAQILK LFLSYDYAVKKPWLAYPHYKPPEKCPSIILHDRLYYLNGIECAASAMEMSAIAAHNAALL AYHRWNGHTDMIDQDGLYEKLKTEL >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_6|1518_bp atggggcgcgtcgtcgcggagctcgtctcctcgctgctggggttgtggctgttgctgtgc agctgcggatgccccgagggcgccgagctgcgtgctccgccagataaaatcgcgattatt ggagccggaattggtggcacttcagcagcctattacctgcggcagaaatttgggaaagat gtgaagatagacctgtttgaaagagaagaggtcgggggccgcctggctaccatgatggtg caggggcaagaatacgaggcaggaggttctgtcatccatcctttaaatctgcacatgaaa cgttttgtcaaagacctgggtctctctgctgttcaggcctctggtggcctactggggata tataatggagagactctggtatttgaggagagcaactggttcataattaacgtgattaaa ttagtttggcgctatggatttcaatccctccgtatgcacatgtgggtagaggacgtgtta gacaagttcatgaggatctaccgctaccagtctcatgactatgccttcagtagtgtcgaa aaattacttcatgctctaggaggagatgacttccttggaatgcttaatcgaacacttctt gaaaccttgcaaaaggccggcttttctgagaagttcctcaatgaaatgattgctcctgtt atgagggtcaattatggccaaagcacggacatcaatgcctttgtgggggcggtgtcactg tcctgttctgattctggcctttgggcagtagaaggtggcaataaacttgtttgctcaggg cttctgcaggcatccaaaagcaatcttatatctggctcagtaatgtacatcgaggagaaa acaaagaccaagtacacaggaaatccaacaaagatgtatgaagtggtctaccaaattgga actgagactcgttcagacttctatgacatcgtcttggtggccactccgttgaatcgaaaa atgtcgaatattacttttctcaactttgatcctccaattgaggaattccatcaatattat caacatatagtgacaactttagttaagggggaattgaatacatctatctttagctctaga cccatagataaatttggccttaatacagttttaaccactgataattcagatttgttcatt aacagtattgggattgtgccctctgtgagagaaaaggaagatcctgagccatcaacagat ggaacatatgtttggaagatcttttcccaagaaactcttactaaagcacaaattttaaag ctctttctgtcctatgattatgctgtgaagaagccatggcttgcatatcctcactataag cccccggagaaatgcccctctatcattctccatgatcgactttattacctcaatggcata gagtgtgcagcaagtgccatggagatgagtgccattgcagcccacaacgctgcactcctt gcctatcaccgctggaacgggcacacagacatgattgatcaggatggcttatatgagaaa cttaaaactgaactatga >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_7|76_aa MSKAHPPELKKFMDKKLSLKLNGGRHVQGILRGFDPFMNLVIDECVEMATSGQQNNIGMV VIRGNSIIMLEALERV >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_7|231_bp atgagcaaagctcaccctcccgagttgaaaaaatttatggacaagaagttatcattgaaa ttaaatggtggcagacatgtccaaggaatattgcggggatttgatccctttatgaacctt gtgatagatgaatgtgtggagatggcgactagtggacaacagaacaatattggaatggtg gtaatacgaggaaatagtatcatcatgttagaagccttggaacgagtataa >gi568815596r:70112722_70336175|GENSCAN_predicted_peptide_8|138_aa MAELQQLRVQEAVESMVKSLERENIRKMQGLMFRCSASCCEDSQASMKQVHQCIERCHVP LAQAQALVTSELEKFQDRLARCTMHCNDKAKDSIDAGSKELQVKQQLDSCVTKCVDDHMH LIPTMTKKMKEALLSIGK >gi568815596r:70112722_70336175|GENSCAN_predicted_CDS_8|417_bp atggctgagctgcagcagctccgggtgcaggaggcggtggagtccatggtgaagagtctg gaaagagagaacatccggaagatgcagggtctcatgttccggtgcagcgccagctgttgt gaggacagccaggcctccatgaagcaggtgcaccagtgcatcgagcgctgccatgtgcct ctggctcaagcccaggctttggtcaccagtgagctggagaagttccaggaccgcctggcc cggtgcaccatgcattgcaacgacaaagccaaagattcaatagatgctgggagtaaggag cttcaggtgaagcagcagctggacagttgtgtgaccaagtgtgtggatgaccacatgcac ctcatcccaactatgaccaagaagatgaaggaggctctcttatcaattggaaaataa