GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:25:09 Sequence gi568815583r:66399410_66603530 : 204121 bp : 44.60% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 30827 30886 60 1 0 74 99 49 0.071 3.53 1.02 Intr + 35529 35828 300 2 0 86 113 322 0.871 31.43 1.03 Intr + 37337 37483 147 1 0 112 80 244 0.999 26.43 1.04 Intr + 45247 45298 52 0 1 100 94 40 0.720 4.28 1.05 Intr + 50444 52057 1614 0 0 19 37 549 0.002 31.14 1.06 Intr + 63818 63869 52 0 1 60 98 35 0.004 -0.33 1.07 Intr + 77733 77803 71 0 2 108 92 3 0.004 1.53 1.08 Intr + 82346 82470 125 0 2 134 68 230 0.949 25.90 1.09 Intr + 85581 85721 141 2 0 89 44 57 0.709 1.95 1.10 Term + 91093 91206 114 0 0 95 42 112 0.939 5.87 1.11 PlyA + 92095 92100 6 1.05 2.03 PlyA - 94430 94425 6 1.05 2.02 Term - 95143 95027 117 1 0 68 43 186 0.999 10.64 2.01 Init - 98322 98233 90 0 0 97 55 197 0.985 17.79 2.00 Prom - 99502 99463 40 -11.92 3.08 PlyA - 99942 99937 6 1.05 3.07 Term - 100243 99998 246 1 0 76 37 314 0.981 20.89 3.06 Intr - 100767 100647 121 2 1 69 82 83 0.999 6.30 3.05 Intr - 101696 101540 157 0 1 78 108 83 0.999 8.37 3.04 Intr - 102095 101966 130 2 1 34 51 189 0.559 10.07 3.03 Intr - 102503 102379 125 0 2 57 73 181 0.999 13.80 3.02 Intr - 104120 103949 172 2 1 105 80 235 0.999 23.92 3.01 Init - 105381 105379 3 0 0 98 101 0 0.815 2.30 3.00 Prom - 111573 111534 40 -5.46 4.00 Prom + 115996 116035 40 -1.46 4.01 Init + 119492 119560 69 1 0 91 86 -12 0.650 -0.04 4.02 Intr + 121641 121796 156 2 0 87 99 81 0.978 9.31 4.03 Intr + 130277 130342 66 1 0 80 116 21 0.754 3.20 4.04 Intr + 132838 132994 157 0 1 60 97 50 0.747 2.68 4.05 Term + 134048 134073 26 0 2 87 42 23 0.220 -4.01 4.06 PlyA + 134538 134543 6 1.05 5.12 PlyA - 135444 135439 6 1.05 5.11 Term - 140620 140597 24 1 0 91 44 29 0.297 -2.98 5.10 Intr - 147386 147246 141 2 0 54 91 68 0.686 4.25 5.09 Intr - 152452 152253 200 2 2 53 80 115 0.942 6.27 5.08 Intr - 158474 158313 162 0 0 128 85 134 0.999 17.05 5.07 Intr - 158627 158573 55 2 1 117 105 30 0.779 6.15 5.06 Intr - 161692 161597 96 1 0 82 100 87 0.872 9.51 5.05 Intr - 161906 161778 129 2 0 97 102 153 0.995 18.49 5.04 Intr - 164216 164107 110 0 2 69 94 111 0.992 9.80 5.03 Intr - 164589 164502 88 0 1 99 83 51 0.719 5.24 5.02 Intr - 165430 165267 164 2 2 70 109 119 0.994 11.89 5.01 Init - 165956 165839 118 2 1 109 60 173 0.998 14.96 5.00 Prom - 171870 171831 40 -2.06 6.04 PlyA - 173132 173127 6 1.05 6.03 Term - 175294 175175 120 1 0 89 48 72 0.366 1.77 6.02 Intr - 187901 187585 317 0 2 -68 3 423 0.495 13.98 6.01 Init - 189721 189661 61 1 1 78 110 43 0.972 6.91 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583r:66399410_66603530|GENSCAN_predicted_peptide_1|891_aa HSCLPDATGEGELYNSLGSLTWSFLSMIGVLLWVDFSGDSIDLCSPLWNRTNLEALQKKL EELELDEQQRKRLEAFLTQKQKVGELKDDDFEKISELGAGNGGVVFKVSHKPSGLVMARK LIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFYSDGEISICMEHMVIKGLTYLREK HKIMHRAWTTRAKLSKILNKILANRIQQHIKKLIHHDQVGFIPGMQGWFNIRKSINVIQH INRAKDKNHMIISIDAEKAFDKIQQPFMLKTLNKLGIDGTYFKIIRAIYDIPTANIIPNG QKLEAFPLKTGTRQGCPLSPLLFNIVLEVLARAIRQEKEIKGIQLGKEEVKLSLFADDMI VYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTNNRQTESQIMSELPFTIASKR IKYLGIQLTRDVKDLFKENYKPLLKEIKEDTNKWKNIPCSWVGRINIVKMAILPKVIYRF NAIPIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPDFKLYYKATVTK TAWYWYQNRDIDQWNRTEPSEIMPHIYNYLIFDKPEKNKQWGKDSLFNKWCWENWLAICR KLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGVGKDFMSKTPKAMATK AKIDKWDLIKLKSFCTAKETTIRVNRQPTTWEKIFATYSSDKGLISRIYNELKQIYKKKT TPSKNWKDSRIYVKREDISIEQQCFSLERVGEAGADSQGWWQAVLYVKPSNILVNSRGEI KLCDFGVSGQLIDSMANSFVGTRSYMSPERLQGTHYSVQSDIWSMGLSLVEMAVGRYPIP PPDAKELELMFGCQVHAFIKRSDAEEVDFAGWLCSTIGLNQPSTPTHAAGV >gi568815583r:66399410_66603530|GENSCAN_predicted_CDS_1|2676_bp cattcctgcttgccagatgccacaggggaaggagaactttacaactccctgggctctctg acctggagctttctttccatgataggagtacttctttgggttgacttctctggtgacagt attgacttgtgctccccactttggaacaggaccaacttggaggccttgcagaagaagctg gaggagctagagcttgatgagcagcagcgaaagcgccttgaggcctttcttacccagaag cagaaggtgggagaactgaaggatgacgactttgagaagatcagtgagctgggggctggc aatggcggtgtggtgttcaaggtctcccacaagccttctggcctggtcatggccagaaag ctaattcatctggagatcaaacccgcaatccggaaccagatcataagggagctgcaggtt ctgcatgagtgcaactctccgtacatcgtgggcttctatggtgcgttctacagcgatggc gagatcagtatctgcatggagcacatggtaataaaaggcctgacatatctgagggagaag cacaagatcatgcacagagcctggacaacaagagcaaaactgtcaaaaatcctcaataaa atactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggc ttcatccctgggatgcaaggctggttcaatatacgcaaatcaataaatgtaatccagcat ataaacagagccaaagacaaaaaccacatgattatctcaatagatgcagaaaaagccttt gacaaaattcaacaacccttcatgctaaaaactctcaataaattaggtattgatgggacg tatttcaaaataataagagctatctatgacatacccacagccaatatcataccgaatggg caaaaactggaagcattccctttgaaaactggcacaagacagggatgccctctctcacca ctcctattcaacatagtgttggaagttctggccagggcaatcaggcaggagaaggaaata aagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagacgacatgatt gtttatctagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaacttc agcaaagtctcaggatacaaaatcaatgtacaaaaatcacaagcattcttatacaccaac aacagacaaacagagagccaaatcatgagtgaactcccattcacaattgcttcaaagaga ataaaatacctaggaatccaacttacaagggatgtgaaggacctcttcaaggagaactac aaaccactgctcaaggaaataaaagaggacacaaacaaatggaagaacattccatgctca tgggtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttacagattc aatgccatccccatcaagctaccaatgactttcttcacagaattggaaaaaactacttta aagttcatatggaaccaaaaaagagcccgcatcgccaagtcaatcctaagccaaaagaac aaagctggaggcatcacactacctgacttcaaactatactacaaggctacagtaaccaaa acagcatggtactggtaccaaaacagagatatagatcaatggaacagaacagagccctca gaaataatgccacatatctacaactatctgatctttgacaaacctgagaaaaacaagcaa tggggaaaggattccctatttaataaatggtgctgggaaaactggctagccatatgtaga aagctgaaactggatcccttccttacaccttatacaaaaatcaattcaagatggattaaa gatttaaacgttagacctaaaaccataaaaaccctagaagaaaacctaggcattaccatt caggacataggcgtgggcaaggacttcatgtccaaaacaccaaaagcaatggcaacaaaa gccaaaattgacaaatgggatctaattaaactcaagagcttctgcacagcaaaagaaact accatcagagtgaacaggcaacctacaacatgggagaaaattttcgcaacctactcatct gacaaagggctaatatccagaatctacaatgaactcaaacaaatttacaagaaaaaaaca accccatcaaaaaactggaaggacagtaggatatacgtgaagagagaagatatctcgatt gaacagcagtgtttttccctggagcgggtgggagaagctggagctgacagccagggatgg tggcaagctgtgttatatgtcaagccctccaacatcctagtcaactcccgtggggagatc aagctctgtgactttggggtcagcgggcagctcatcgactccatggccaactccttcgtg ggcacaaggtcctacatgtcgccagaaagactccaggggactcattactctgtgcagtca gacatctggagcatgggactgtctctggtagagatggcggttgggaggtatcccatccct cctccagatgccaaggagctggagctgatgtttgggtgccaggttcatgcttttatcaag agatctgatgctgaggaagtggattttgcaggttggctctgctccaccatcggccttaac cagcccagcacaccaacccatgctgctggcgtctaa >gi568815583r:66399410_66603530|GENSCAN_predicted_peptide_2|68_aa MLSRLQELRKEEETLLRLKAALHDQLNRLKMLVHVDNEASINQTTLELSTKSHVTEEEEE EEEEESDS >gi568815583r:66399410_66603530|GENSCAN_predicted_CDS_2|207_bp atgctgagccggcttcaggaactgcgcaaggaggaggagacgctgctgcggttgaaggca gccctgcacgaccagctgaaccgcctcaagatgttggtgcatgtagacaatgaagcatca atcaaccaaacaaccctggagctgagcacaaagagtcatgtgacggaagaggaggaggag gaagaggaagaagaatcagattcctaa >gi568815583r:66399410_66603530|GENSCAN_predicted_peptide_3|317_aa MACARPLISVYSEKGESSGKNVTLPAVFKAPIRPDIVNFVHTNLRKNNRQPYAVSELAGH RIEEVPELPLVVEDKVEGYKKTKEAVLLLKKLKAWNDIKKVYASQRMRAGKGKMRNRRRI QRRGPCIIYNEDNGIIKAFRNIPGITLLNVSKLNILKLAPGGHVGRFCIWTESAFRKLDE LYGTWRKAASLKSNYNKKIHRRVLKKNPLKNLRIMLKLNPYAKTMRRNTILRQARNHKLR VDKAAAAAAALQAKSDEKAAVAGKKPVVGKKGKKAAVGVKKQKKPLVGKKAAATKKPAPE KKPAEKKPTTEEKKPAA >gi568815583r:66399410_66603530|GENSCAN_predicted_CDS_3|954_bp atggcgtgtgctcgcccactgatatcggtgtactccgaaaagggggagtcatctggcaaa aatgtcactttgcctgctgtattcaaggctcctattcgaccagatattgtgaactttgtt cacaccaacttgcgcaaaaacaacagacagccctatgctgtcagtgaattagcaggtcat cgtattgaggaagttcctgaacttcctttggtagttgaagataaagttgaaggctacaag aagaccaaggaagctgttttgctccttaagaaacttaaagcctggaatgatatcaaaaag gtctatgcctctcagcgaatgagagctggcaaaggcaaaatgagaaaccgtcgccgtatc cagcgcaggggcccgtgcatcatctataatgaggataatggtatcatcaaggccttcaga aacatccctggaattactctgcttaatgtaagcaagctgaacattttgaagcttgctcct ggtgggcatgtgggacgtttctgcatttggactgaaagtgctttccggaagttagatgaa ttgtacggcacttggcgtaaagccgcttccctcaagagtaactacaacaagaagatccat cgcagagtcctaaagaagaacccactgaaaaacttgagaatcatgttgaagctaaaccca tatgcaaagaccatgcgccggaacaccattcttcgccaggccaggaatcacaagctccgg gtggataaggcagctgctgcagcagcggcactacaagccaaatcagatgagaaggcggcg gttgcaggcaagaagcctgtggtaggtaagaaaggaaagaaggctgctgttggtgttaag aagcagaagaagcctctggtgggaaaaaaggcagcagctaccaagaaaccagcccctgaa aagaagcctgcagagaagaaacctactacagaggagaagaagcctgctgcataa >gi568815583r:66399410_66603530|GENSCAN_predicted_peptide_4|157_aa MAHNPNMTHLKINLPVTALPPLWVTSKGFAQYELFKSSALDDTITASQTAIALDISWSPV DEILQIPPLSSTATLLCKVRQVPLLFLCPNILICKVKLHSGSNSLLSKLIHQSYHGTMDT VSLSGTIPVQMLLEIGLDKLKKDYISFFIAHSETSES >gi568815583r:66399410_66603530|GENSCAN_predicted_CDS_4|474_bp atggctcacaatcctaatatgacccatttgaagattaatctgccagttactgcccttcct cccctttgggtaacatccaaaggctttgcccagtatgagctctttaagtcctctgccttg gatgatacaatcacagcatcacaaactgcgatcgctttggatatttcctggagtcctgtg gatgagattcttcaaatccctccactctcttcaactgcaactctgctgtgtaaagttcgg caggttcctttacttttcttgtgccccaacatcctcatctgtaaagtaaagctccatagt ggaagtaacagtttactaagtaagctcattcatcagtcttatcatggaaccatggacaca gtttctctcagtgggactattccagttcaaatgcttttggaaattggtttggacaaacta aagaaagattatatcagttttttcatagcccattcagaaacttcagaaagttga >gi568815583r:66399410_66603530|GENSCAN_predicted_peptide_5|428_aa MKPVWVATLLWMLLLVPRLGAARKGSPEEASFYYGTFPLGFSWGVGSSAYQTEGAWDQDG KGPSIWDVFTHSGKGKVLGNETADVACDGYYKVQEDIILLRELHVNHYRFSLSWPRLLPT GIRAEQVNKKGIEFYSDLIDALLSSNITPIVTLHHWDLPQLLQVKYGGWQNVSMANYFRD YANLCFEAFGDRVKHWITFSDPRAMAEKGYETGHHAPGLKLRGTGLYKAAHHIIKAHAKA WHSYNTTWRSKQQGLVGISLNCDWGEPVDISNPKDLEAAERYLQFCLGWFANPIYAGDYP QVMKDYIAIKDGANIKGYTSWSLLDKFEWEKGYSDRYGFYYVEFNDRNKPRYPKASVQYY KKIIIANGFPNPREQIDIDNVERTKRKVLITKYLTKRLKCEHKYLIEDFISTLHLKCTWL QPFSEVFM >gi568815583r:66399410_66603530|GENSCAN_predicted_CDS_5|1287_bp atgaagccagtgtgggtcgccacccttctgtggatgctactgctggtgcccaggctgggg gccgcccggaaggggtccccagaagaggcctccttctactatggaaccttccctcttggc ttctcctggggcgtgggcagttctgcctaccagacggagggcgcctgggaccaggacggg aaagggcctagcatctgggacgtcttcacacacagtgggaaggggaaagtgcttgggaat gagacggcagatgtagcctgtgacggctactacaaggtccaggaggacatcattctgctg agggaactgcacgtcaaccactaccgattctccctgtcttggccccggctcctgcccaca ggcatccgagccgagcaggtgaacaagaagggaatcgaattctacagtgatcttatcgat gcccttctgagcagcaacatcactcccatcgtgaccttgcaccactgggatctgccacag ctgctccaggtcaaatacggtgggtggcagaatgtgagcatggccaactacttcagagac tacgccaacctgtgctttgaggcctttggggaccgtgtgaagcactggatcacgttcagt gatcctcgggcaatggcagaaaaaggctatgagacgggccaccatgcgccgggcctgaag ctccgcggcaccggcctgtacaaggcagcacaccacatcattaaggcccacgccaaagcc tggcattcttataacaccacgtggcgcagcaagcagcaaggtctggtgggaatttcattg aactgtgactggggggaacctgtggacattagtaaccccaaggacctagaggctgccgag agatacctacagttctgtctgggctggtttgccaaccccatttatgccggtgactacccc caagtcatgaaggactacattgctataaaagatggtgctaatataaaggggtatacttcc tggtctctgttggataagtttgaatgggagaaaggatactcagatagatatggattctac tatgttgaatttaacgacagaaataagcctcgctatccaaaggcttcagttcaatattac aagaagattatcattgccaatgggtttcccaatccaagagagcaaatcgacattgataat gtagaaagaactaaacgtaaggtacttattacaaagtatttgacaaagagactaaaatgt gaacataaataccttatagaggacttcatcagcacacttcacttgaaatgcacctggctg cagccatttagtgaagtcttcatgtga >gi568815583r:66399410_66603530|GENSCAN_predicted_peptide_6|165_aa MTGSQQWDVRSNGCYFLVKAEEREEKEEEGEEEKEGEEDKEEEKEEEKEEEKRKKEKEKQ EEEGKKKRKKEEEEKEEEEEEGEEEGEKEEEGEKEEEEGEKEEEEEEGENGEGEKGGRGG GEGGREPPPFTLQKLEAEDLAGIFSSQPWGSPQVEMGGEKCKARS >gi568815583r:66399410_66603530|GENSCAN_predicted_CDS_6|498_bp atgactggttctcagcaatgggatgtgagaagtaatgggtgttacttcctggtcaaggca gaagagagagaagagaaggaggaggagggggaggaagagaaggagggggaggaagataag gaggaagagaaggaggaggagaaggaagaggagaagaggaagaaagagaaggagaagcag gaggaggaggggaagaagaagaggaagaaagaggaggaggagaaggaggaggaggaagag gagggggaggaggaaggggagaaggaagaggaaggggagaaggaggaagaggaaggggag aaggaggaggaggaagaggagggggagaatggggagggggagaaggggggaaggggaggg ggagaagggggaagggaacctcctcctttcacccttcagaaactggaagcagaggacttg gctggcatcttcagcagccagccatggggttccccacaggtggaaatggggggagaaaaa tgcaaggcacgatcctga