GENSCAN 1.0 Date run: 4-Nov-116 Time: 00:45:50 Sequence gi568815596r:144289454_144617350 : 327897 bp : 38.55% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 PlyA - 366 361 6 1.05 1.04 Term - 17163 17128 36 0 0 81 54 51 0.358 -2.64 1.03 Intr - 26471 26369 103 1 1 75 98 117 0.966 10.66 1.02 Intr - 28135 28026 110 1 2 67 116 13 0.970 0.16 1.01 Init - 31151 31110 42 2 0 102 105 66 0.985 10.17 1.00 Prom - 32333 32294 40 -9.65 2.00 Prom + 34760 34799 40 -4.65 2.01 Init + 38187 38318 132 2 0 112 97 53 0.823 8.79 2.02 Term + 43541 43747 207 1 0 54 44 187 0.835 7.26 2.03 PlyA + 44288 44293 6 1.05 3.05 PlyA - 45840 45835 6 1.05 3.04 Term - 76151 75981 171 2 0 46 48 150 0.011 3.64 3.03 Intr - 87453 87323 131 1 2 58 73 61 0.064 1.09 3.02 Intr - 91577 91430 148 2 1 96 36 133 0.159 7.79 3.01 Init - 92371 92255 117 1 0 15 72 133 0.982 4.65 3.00 Prom - 95676 95637 40 -6.85 4.10 PlyA - 96050 96045 6 1.05 4.09 Term - 100575 99998 578 1 2 118 38 781 0.999 69.74 4.08 Intr - 107139 106959 181 0 1 68 80 139 0.942 9.62 4.07 Intr - 110892 108848 2045 1 2 78 103 1137 0.615 100.57 4.06 Intr - 111854 111746 109 2 1 87 89 95 0.999 8.44 4.05 Intr - 114677 114463 215 0 2 99 75 269 0.985 24.21 4.04 Intr - 115571 115383 189 0 0 64 77 263 0.881 21.44 4.03 Intr - 135414 135343 72 1 0 77 109 79 0.699 7.36 4.02 Intr - 140573 140316 258 0 0 51 111 271 0.687 22.21 4.01 Init - 148673 148610 64 2 1 81 86 35 0.317 3.96 4.00 Prom - 176703 176664 40 -5.55 5.04 PlyA - 177605 177600 6 1.05 5.03 Term - 184660 184572 89 2 2 102 42 120 0.762 5.64 5.02 Intr - 191264 191240 25 2 1 80 96 13 0.481 -2.02 5.01 Init - 195539 195486 54 2 0 47 89 103 0.914 7.63 5.00 Prom - 199579 199540 40 -7.65 6.00 Prom + 200872 200911 40 -6.05 6.01 Sngl + 201521 201691 171 1 0 75 45 203 0.975 9.35 6.02 PlyA + 202220 202225 6 1.05 7.07 PlyA - 205709 205704 6 1.05 7.06 Term - 209099 208977 123 2 0 87 47 89 0.434 2.10 7.05 Intr - 215180 215045 136 1 1 129 42 41 0.214 3.25 7.04 Intr - 219973 219808 166 2 1 99 50 93 0.277 4.70 7.03 Intr - 227966 227825 142 2 1 60 107 122 0.108 10.31 7.02 Intr - 237683 237564 120 2 0 94 78 80 0.471 7.37 7.01 Init - 238687 238544 144 1 0 75 93 9 0.358 0.27 7.00 Prom - 241608 241569 40 -3.65 8.00 Prom + 242409 242448 40 -3.35 8.01 Init + 250067 250134 68 1 2 70 36 66 0.018 0.20 8.02 Intr + 279547 279678 132 1 0 2 58 143 0.017 1.54 8.03 Term + 291660 291777 118 0 1 101 43 163 0.412 10.13 8.04 PlyA + 292675 292680 6 1.05 9.03 PlyA - 292713 292708 6 1.05 9.02 Term - 293649 293497 153 0 0 48 42 129 0.240 1.24 9.01 Init - 296270 296187 84 2 0 44 85 94 0.645 5.57 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 233459 233684 226 2 1 82 41 185 0.900 8.47 S.002 Term + 271749 271918 170 1 2 29 39 200 0.902 6.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_1|96_aa MEQAKVGELEFDLKCFLQRTHLVVIFIGYEEGSDPGIRGLQKTFWRKINVRGSEVKHSQG DSVMAELKCEYCGTLVWDVVMKTGQVDVLTHNGIQN >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_1|291_bp atggagcaggccaaagttggagaactggaatttgacctgaagtgttttttacaaaggacc catttagtcgtgatatttataggatatgaggaaggcagtgatcctggcattcgtggactc cagaaaacattttggagaaaaattaatgtaagggggagtgaagtaaagcacagccaaggt gatagtgtcatggcagagcttaaatgtgagtactgtggcaccttggtgtgggatgtggta atgaagacaggacaggttgacgtgctaactcacaatggtattcagaactga >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_2|112_aa MEPRSQYLKGQSGDVSPENSITGSWLCKTPTTYLWIRIGPSPLKSLRAPVRVPGSLADTS TCSAARSSSTAALPMPLNIPPSGPPRFPPGKAHAFLYLHFSRPFGSDYSTHF >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_2|339_bp atggagcctagatctcagtatctaaagggtcaatctggtgatgtgagtccagaaaacagt atcacaggatcttggctctgcaagaccccaaccacctacctatggataagaataggtccc agccccctaaagagcctcagggcccctgtgcgggttcctggctcccttgccgacacctct acctgctctgcagcccggagctcctccacagctgctttgccaatgcctcttaatatccct ccttctggacctcctcgattccctcctggcaaggctcatgctttcctttatctccatttt tcccgtcctttcggttctgactactccacacacttctga >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_3|188_aa MILTWSEEETRQQVYTVRGAVGIHKSAMGKLEQTETDQKVKEGDVIVNEMEVADSQLGVG ADPQKPLRAVVIDLSLKVVKRSAMGRLCSTSSLTFTKWTWQYPSPFPLQNLVNVTHQVTQ VKVMRVILLDLYRRRAKCLVLERNMIIFAFRIPVVLSPVEDSTGQLIVCKEWDVIQQNFR ERKRCRVS >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_3|567_bp atgatcctgacgtggtctgaggaggaaacaaggcagcaggtgtacacagtaagaggagca gtaggaattcataaatctgcaatgggaaaattggaacagacagaaacagaccaaaaggta aaagaaggtgatgtgattgtaaatgagatggaagtagcggacagtcagctgggggtgggt gcagatccacaaaagcccttgagggcagtggtgattgacttgagcctgaaggttgtaaag aggtcagccatgggaagattgtgcagcacctcaagcctgacctttacaaaatggacatgg cagtacccttctcccttccccctacaaaatctggtgaatgtcacccaccaggtcactcag gtgaaagtcatgagagtcatcctgctggatctctatagaagaagagctaagtgcttggtt ctggagcggaacatgatcatttttgcctttcggattcctgtggtattgtcccctgtggag gattcaacaggtcagctaattgtctgtaaggaatgggatgttattcagcagaatttcagg gagaggaagagatgcagagtctcataa >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_4|1236_aa MECLLQEKYFFKRQAVATSDEVVNYDNVVDTGSETDEEDKLHIAEDDGIANPLDQETSPA SVPNHESSPHVSQALLPREEEEDEIREGGVEHPWHNNEILQASVDGPEEMKEDYDTMGPE ATIQTAINNGTVKNANCTSDFEEYFAKRKLEERDGHAVSIEEYLQRSDTAIIYPEAPEEL SRLGTPEANGQEENDLPPGTPDAFAQLLTCPYCDRGYKRLTSLKEHIKYRHEKNEENFSC PLCSYTFAYRTQLERHMVTHKPGTDQHQMLTQGAGNRKFKCTECGKAFKYKHHLKEHLRI HSDSSLGRQLQLLSNQGNVTVILIIILGEKPYECPNCKKRFSHSGSYSSHISSKKCIGLI SVNGRMRNNIKTGSSPNSVSSSPTNSAITQLRNKLENGKPLSMSEQTGLLKIKTEPLDFN DYKVLMATHGFSGTSPFMNGGLGATSPLGVHPSAQSPMQHLGVGMEAPLLGFPTMNSNLS EVQKVLQIVDNTVSRQKMDCKAEEISKLKGYHMKDPCSQPEEQGVTSPNIPPVGLPVVSH NGATKSIIDYTLEKVNEAKACLQSLTTDSRRQISNIKKEKLRTLIDLVTDDKMIENHNIS TPFSCQFCKESFPGPIPLHQHERYLCKMNEEIKAVLQPHENIVPNKAGVFVDNKALLLSS VLSEKGMTSPINPYKDHMSVLKAYYAMNMEPNSDELLKISIAVGLPQEFVKEWFEQRKVY QYSNSRSPSLERSSKPLAPNSNPPTKDSLLPRSPVKPMDSITSPSIAELHNSVTNCDPPL RLTKPSHFTNIKPVEKLDHSRSNTPSPLNLSSTSSKNSHSSSYTPNSFSSEELQAEPLDL SLPKQMKEPKSIIATKNKTKASSISLDHNSVSSSSENSDEPLNLTFIKKEFSNSNNLDNK STNPVFSMNPFSAKPLYTALPPQSAFPPATFMPPVQTSIPGLRPYPGLDQMSFLPHMAYT YPTGAATFADMQQRRKYQRKQGFQGELLDGAQDYMSGLDDMTDSDSCLSRKKIKKTESGM YACDLCDKTFQKSSSLLRHKYEHTGKRPHQCQICKKAFKHKHHLIEHSRLHSGEKPYQCD KCGKRFSHSGSYSQHMNHRYSYCKREAEEREAAEREAREKGHLEPTELLMNRAYLQSITP QGYSDSEERESMPRDGESEKEHEKEGEDGYGKLGRQDGDEEFEEEEEESENKSMDTDPET IRDEEETGDHSMDDSSEDGKMETKSDHEEDNMEDGM >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_4|3711_bp atggaatgccttttgcaagaaaaatatttctttaagagacaagctgttgctacttcagat gaagtggtgaactatgacaatgtagtggacacaggttctgaaacagatgaggaagacaag cttcatattgctgaggatgacggtattgccaaccctctggaccaggagacgagtccagct agtgtgcccaaccatgagtcctccccacacgtgagccaagctctgttgccaagagaggaa gaggaagatgaaataagggagggtggagtggaacacccctggcacaacaacgagattcta caagcctctgtagatggtccagaagaaatgaaggaagactatgacactatggggccagaa gccacgatccagaccgcaattaacaatggtacagtgaagaatgcaaattgcacatcagat tttgaggaatactttgccaaaagaaaactggaggaacgcgatggtcatgcagtcagcatc gaggagtaccttcagcgcagtgacacagccattatttacccagaagcccctgaggagctg tctcgccttggcacgccagaggccaatgggcaagaagaaaatgacctgccacctggaact ccagatgcttttgcccaactgctgacctgcccctactgcgaccggggctacaagcgcttg acatcactgaaggagcacatcaagtaccgccacgagaagaatgaagagaacttttcctgc cctctctgtagctacacgtttgcctaccgcacccagctcgagcggcatatggtgacacac aagccagggacagatcagcaccaaatgctaacccaaggagcaggtaatcgcaagttcaaa tgcacagagtgtggcaaggccttcaaatataaacaccatctgaaagaacacctgcgaatt cacagtgattcttctcttggtagacaattgcaattgttatctaatcaaggaaatgtaacg gtaattttaattatcattttaggtgaaaaaccttacgagtgcccaaactgcaagaaacgt ttctcccattctggttcctacagttcgcacatcagcagcaagaaatgtattggtttaatc tctgtaaatggccgaatgagaaacaatatcaagacgggttcttcccctaattctgtttct tcttctcctactaattcagccattacccagttaagaaacaagttggagaatggaaaacca cttagtatgtctgaacagacaggcttacttaaaattaaaacagaaccactagacttcaat gactataaagttcttatggctacacacgggtttagtggcactagtccctttatgaatggt gggcttggagccaccagccctttaggagttcatccatctgctcagagtccaatgcagcac ttaggtgtagggatggaagcccctttacttgggtttcccaccatgaatagtaatttaagt gaggtacaaaaggttctacagattgtggacaatactgtttccaggcaaaaaatggactgc aaggctgaagaaatttcaaagttgaaaggttatcacatgaaggatccatgctctcaacct gaggaacaaggagttacttctcctaatattccgcctgtcggtcttccggtagtgagtcat aatggtgccactaaaagtattattgactatacgttggaaaaagtcaatgaagccaaagct tgcctccagagcttgactactgactcaaggagacagatcagtaatataaagaaagagaag ctacgtactttaatagatttggtcactgatgacaaaatgattgagaaccacaacatatcc actccattttcatgccagttctgtaaagaaagttttcctggccccatccctttgcatcag catgaacgttacctttgtaagatgaatgaagagatcaaggcggtcctgcagcctcatgaa aacatagtccccaacaaagccggagtttttgttgataataaagccctcctcttgtcatct gtactttctgagaaaggaatgacaagccccatcaacccatacaaggaccacatgtctgta ctcaaagcatactatgctatgaacatggagcccaactccgatgaactgctgaaaatttcc attgctgtgggccttcctcaggaatttgtgaaggaatggtttgaacaacgaaaagtctac cagtactcaaattccaggtccccatccctggaaagaagctccaagccgttagctcccaac agtaaccctcccacaaaagactctttattacccaggtctcctgtaaaacctatggactcc ataacatcaccatctatagcagaactccacaacagtgttacgaattgtgatcctcctctc aggctaacaaaaccttcccattttaccaatattaaaccagttgaaaaattggaccactcc aggagtaatactccttctcccttaaatctttcctccacatcttctaaaaactcccacagt agttcatacactccaaacagcttctcttctgaggagctccaggctgagcctttagacttg tcattaccaaaacaaatgaaagaacccaaaagtattatagccacaaagaacaaaacaaaa gctagtagcatcagtttagatcataacagtgtttcttcctcatctgaaaactcagatgag cctctgaacttgacttttatcaagaaggaattttcaaattcaaataatctggacaacaaa agcactaacccagtgttcagcatgaacccatttagtgccaaacctttatacacagctctt ccacctcaaagcgcatttccccctgctactttcatgccaccagtccagaccagtattcct gggctacgaccatacccaggactggatcagatgagcttcctaccacatatggcctacacc tacccaactggagcagctacttttgctgatatgcagcaaaggagaaagtaccagcggaaa caaggatttcagggagaattgcttgatggagcacaagactacatgtcaggcctagatgat atgacagactccgactcctgtctgtctcgcaaaaagatcaagaagacagagagtggcatg tatgcatgtgacttatgtgacaagacattccagaaaagcagttcccttctgcgacataaa tacgaacacacaggaaaaagaccacatcagtgtcagatttgtaagaaagcgtttaaacac aagcaccaccttatcgagcactcaaggcttcactcgggcgagaagccctatcagtgtgat aaatgtggcaagcgcttctcacactcgggctcgtactcgcagcacatgaatcacaggtat tcctactgcaagcgggaggcggaggagcgggaagcggcggagcgcgaggcgcgcgagaaa gggcacttggaacccaccgagctgctgatgaaccgggcttacttgcagagcattacccct caggggtactctgactcggaggagagggagagtatgccgagggatggcgagagcgagaag gagcacgagaaagaaggcgaggatggctacgggaagctgggcagacaggatggcgacgag gagttcgaggaggaagaggaagaaagtgaaaataaaagtatggatacggatcccgaaacg atacgagatgaagaagagactggagatcactccatggacgatagttcggaggatgggaaa atggaaaccaaatcagaccacgaggaagacaatatggaagatggcatgtaa >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_5|55_aa MYAKSEGTQDQKIQYARVYREKIEMTVLSSGGALLRTARTRPINEPVYLQTMTQD >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_5|168_bp atgtatgccaagagtgaagggacccaggaccagaaaattcagtatgcccgggtgtataga gagaaaatagagatgacagtgctgtcttcaggaggggcgctgctcaggactgcccggaca aggccaattaatgagcctgtgtatttgcaaacgatgacccaggattag >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_6|56_aa MAAKEPLCSAVSVIRALRTPRVRIDTIDRNSSEEAVMVYHASDVRILAGPLALLWK >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_6|171_bp atggctgctaaggagccgttatgtagtgccgtgtcagtgatccgtgcgttgaggactccc cgagtacgcattgacacgatagaccggaacagctcagaagaagctgttatggtctatcat gccagtgatgtgcggatattggcagggcctctggcccttctgtggaaatga >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_7|276_aa MSLSVRSKRQNKPYMEKQGPITASRRPGNGSQCAERVGTSGSETHLGKVLEISANPRTAT IPKGNAQFIMNGLKANSALTAKFLVFKFIEPACCRSRAPSPCELPSDPLLSMKQPIMADG PRCKRRKQANPRRKNEDLQNLQKEIIPTPSTPPIHPHDPSNEGFRQIYFSKHLWTLAGKV GVCLQKEAFVKGYLFPMVQNRLDITVHQFQGFPWRYLHQISPVKLQVEIGRPETHRLLPP TNCDDDDFIFQQSVHFQRGWVQTSSKNGFLDFLQDL >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_7|831_bp atgagcctaagtgttaggtccaaaaggcaaaataagccatacatggaaaaacaagggcca atcacagcatccaggaggcctggaaatgggtcacaatgtgcagagcgggtgggcacaagt gggtcagaaacacatctgggaaaggtacttgaaattagcgcaaaccccaggacagctaca attccaaaagggaatgctcagttcattatgaatggtcttaaagcaaacagtgctctgaca gccaaattcctggtcttcaaatttatcgagcctgcgtgctgccgaagcagggcgccgagt ccatgcgaactgccatctgatccgctcttatcaatgaagcagccgatcatggcggatggc ccccggtgcaagaggcgcaaacaagccaatcccaggaggaaaaacgaagatttacagaat ttacaaaaagagattatccctacaccttccacccctcctatccacccccatgacccaagc aatgaaggatttaggcagatttacttcagtaaacacctttggacacttgcggggaaagtg ggagtctgtttgcaaaaggaggcttttgttaaaggttaccttttcccgatggtgcaaaat cgacttgacataactgttcatcagtttcagggttttccctggaggtatttgcatcaaatc agcccagtcaagcttcaagttgaaattggcaggccagaaactcacaggctcttgcctccc actaactgtgatgatgacgatttcatattccagcagtcagttcactttcagcgagggtgg gtgcagacttcatcaaagaatggcttcctggatttccttcaggacctgtga >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_8|105_aa METTDSFVDSNLRPNFNNTAVHMAFAERQHRSQDEFVFIDKSEEYLWPSKARGSNSEEVS GHRCSPSLYLANERQPYSEPDVRLLENEHKLVYFKPNGKFYMQGP >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_8|318_bp atggagacaacagactctttcgttgactcaaatttacggccaaattttaataacacagct gtccatatggcatttgcagagaggcagcataggtctcaagacgagtttgtctttatcgat aagagtgaagagtatttgtggccttcaaaagccagaggctccaactctgaggaggtatct ggtcatcgctgttccccgagtttgtaccttgcaaatgagaggcagccttattcagaacca gatgtaaggctcctagagaatgagcataagctggtgtactttaagcctaatgggaagttt tatatgcaagggccttga >gi568815596r:144289454_144617350|GENSCAN_predicted_peptide_9|78_aa MVGYEDGEHEKGQRGLPSQEDIAESQEEEGRKHARSSVQEEIRTGVISDLQLSPSVSAAK KLPFNVVLSSSMLHVSGS >gi568815596r:144289454_144617350|GENSCAN_predicted_CDS_9|237_bp atggtgggatatgaagatggagaacatgaaaaaggacaaaggggtttgccttctcaagag gacattgcagaatctcaggaagaggagggaagaaagcatgctaggagctctgtgcaggag gagatccgcactggtgttatctctgacctgcagcttagccccagtgtctcagctgccaaa aagctcccattcaacgttgtgctgagttcatcaatgctccacgtgtcaggatcctaa