GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:17:37 Sequence gi568815590r:29237029_29450278 : 213250 bp : 43.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 Intr - 8411 8318 94 0 1 121 117 58 0.616 11.02 1.05 Intr - 26061 25952 110 2 2 25 78 133 0.047 5.93 1.04 Intr - 32796 32606 191 0 2 77 82 54 0.016 2.08 1.03 Intr - 36245 36102 144 2 0 68 87 52 0.719 3.58 1.02 Intr - 38857 38826 32 2 2 105 56 10 0.152 -2.65 1.01 Init - 62322 62100 223 0 1 20 72 144 0.273 4.72 1.00 Prom - 65786 65747 40 -6.66 2.00 Prom + 68101 68140 40 -6.36 2.01 Init + 68801 68971 171 1 0 76 88 58 0.873 4.04 2.02 Intr + 70080 70455 376 2 1 81 53 141 0.235 4.49 2.03 Intr + 72148 73062 915 2 0 58 80 194 0.312 6.84 2.04 Intr + 73936 74161 226 2 1 101 -6 89 0.298 -2.16 2.05 Intr + 74683 74906 224 1 2 63 44 126 0.233 3.47 2.06 Term + 75716 75873 158 0 2 105 49 92 0.711 5.10 2.07 PlyA + 77906 77911 6 1.05 3.03 PlyA - 78210 78205 6 -0.45 3.02 Term - 78481 78336 146 2 2 96 36 125 0.623 6.17 3.01 Init - 83294 83207 88 2 1 47 116 21 0.240 -0.34 3.00 Prom - 90941 90902 40 -2.96 4.05 PlyA - 91851 91846 6 1.05 4.04 Term - 100383 99998 386 1 2 63 43 460 0.999 34.05 4.03 Intr - 101473 101254 220 1 1 104 74 301 0.999 28.17 4.02 Intr - 103215 103070 146 1 2 141 74 152 0.999 19.10 4.01 Init - 113250 112818 433 0 1 108 92 978 0.628 96.57 4.00 Prom - 121842 121803 40 -4.56 5.03 PlyA - 122313 122308 6 1.05 5.02 Term - 128360 128091 270 2 0 87 43 148 0.844 5.68 5.01 Init - 129388 129263 126 1 0 77 79 49 0.756 3.06 5.00 Prom - 130191 130152 40 -2.96 6.09 PlyA - 130939 130934 6 1.05 6.08 Term - 132181 132075 107 2 2 84 48 43 0.267 -1.53 6.07 Intr - 135366 135256 111 1 0 95 101 38 0.065 6.15 6.06 Intr - 144256 144109 148 1 1 59 90 87 0.016 5.81 6.05 Intr - 148609 148485 125 2 2 32 94 49 0.008 0.20 6.04 Intr - 153738 153609 130 0 1 85 65 117 0.070 9.37 6.03 Intr - 166345 166274 72 1 0 66 81 36 0.325 0.30 6.02 Intr - 193354 193295 60 1 0 97 95 2 0.051 0.73 6.01 Init - 210076 210014 63 1 0 89 97 23 0.120 4.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 26006 25952 55 2 1 76 78 92 0.891 8.45 S.002 Term - 34870 34691 180 1 0 123 37 93 0.898 5.31 S.003 Init + 143864 144093 230 1 2 71 111 123 0.954 10.74 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590r:29237029_29450278|GENSCAN_predicted_peptide_1|265_aa MSVCSPCDIESNIILSPTGYGEQYHRRVCTTYDIESNILSPPEYCEHYYRGVCTPCNVKS NIILLLLDVTNNITGSGLTSTTHLQGTSLTFNIQEDPSKHLNSLKSKSIQFSLLNPQGCD FTFSGEMQSTYDKVEQPKIMTKSRCGPAAAGCWGVSVKAVDPGEHQTLHEEAGQVVVMIL LPGQGDQGFAFRYLLLRVAGGRGRRKPNEFLGGCRMGDSKVKVAVRIRPMNRRETDLHTK CVVDVDANKVILNPVNTNLSKGDAR >gi568815590r:29237029_29450278|GENSCAN_predicted_CDS_1|795_bp atgagtgtgtgcagcccctgcgatatcgagagtaatatcatcctctcgcccactggatat ggcgaacaatatcacaggagggtgtgcaccacctacgatattgaaagtaatatcctctcg ccccctgaatattgcgaacattattacaggggggtgtgtactccctgcaatgttaaaagt aacatcatcctcttgctcctggatgttacgaacaatatcacaggctcaggcctcacctcc accacccacctgcaggggacatctttaaccttcaacatccaagaagatccttctaaacat ctcaattccttaaaatccaagtcaattcaattcagcctcctaaacccacagggttgtgac tttaccttctcaggagaaatgcaatccacctatgacaaagtggagcagccaaagatcatg acaaagtccaggtgtggccctgcggcggctggttgctggggagtgtcagtaaaggctgtg gacccaggagagcaccagacactacatgaagaggcagggcaagtagtggtaatgatctta ttgcctggacaaggggaccagggctttgcattcaggtatctcttactcagagtcgccggt ggccgcggcagacggaagccgaacgagttcctcggcggctgcaggatgggggactccaaa gtgaaagtggcggtgcggatacgacccatgaaccggcgagagactgacttgcataccaaa tgtgtggtggatgtggatgcaaacaaggttattcttaatcctgtaaatacgaatctttcc aaaggagatgcccgn >gi568815590r:29237029_29450278|GENSCAN_predicted_peptide_2|689_aa MQLQGPDNKLVDPAAMSGTLKGMLPKRDLPIYPIYPDLCPLGPNACETNFLSPLFSEKEI SKEISKGPQKPLGYRLCPLQAIGAGEFGPTRVHVPFSLSDLKQIKADLGKFSDDPDRHID VLQGLGQTFDLAWRDVMLLLDQTLAFYEKNVALAAAREFGDTWYLSQVNDRMTAEERDKF PTANQGYKVSRSKAQLCLQQVKYLGLILAKGTRALSKERIQPILAYPCPKTLKQSRGFLG ITAFWRLWIPRYSEIARPLYSLIKETQRANTHLVEWEPEAETAFKALKQALVQAPVLSLP TGQNFSLYVTERARIALGVLTQTRGTTPQPVAYLSKEIDVVAKGWPHCLRVVAAVAVLVS EASKIIQGKDLTVWTTHDVNGILGAKGSLWLSDNCLLRYQALLLEGLVLQIRMCVALNPA TFLPEDGEPIEHDCQQIIVQTYATRDDLLDVPLTIPDLNLYTDGSSVVENGIQRTGYTIV SDVTILESPVAAILLLLAFGACIFNLLVKFVSSRIEAFELQMVLQMEPQMSSTNNFYRGP MDRPAGTSPGLESSPLKDTTTAGAWDLQPAMPEPPTPSVGSCVARASPMSAAPCSMAPSP IDHPRAEECGHGTGLAGSSPAAPVRDPLGEASWAPESDIRNNITVGVYTPCDIGSNNIFS PLDIMNNITKGAYTSCDIGSNIILSVPGY >gi568815590r:29237029_29450278|GENSCAN_predicted_CDS_2|2070_bp atgcagctccagggtcccgacaacaagttggttgaccctgcggccatgagtggaactctc aaaggcatgttgcccaagcgagacttgcccatctatcctatctatcctgacctttgcccc ctgggtcctaatgcctgtgagacaaacttcctctcgcctctcttctctgagaaggaaata agcaaagaaatctccaaaggtccacaaaaacccctgggctatcgattatgtccccttcaa gctataggggcaggggaatttggcccaacccgggtacatgtccccttctccctctctgat ttaaagcagatcaaggcagacctggggaagttttcagatgatcctgataggcacatagat gtcctacagggtctagggcaaacctttgacctcgcttggagagatgtcatgctactgtta gatcaaaccctggccttttatgaaaagaatgtggctttagctgcagcccgagagtttgga gatacctggtatcttagtcaagtaaatgatagaatgacagccgaagaaagggacaaattc cctactgctaatcaagggtacaaggtgtctaggtcgaaggcccagctttgcctacagcag gttaaatatctaggcctaatcttagccaaagggaccagggccctcagcaaggaacgaata cagcctatactggcttatccttgccctaagacattaaaacagtcgagggggttccttgga attaccgccttttggcgactatggatccccagatacagcgagatagccaggcccctctat agtctaatcaaggaaacccagagggcaaatactcatctagtagaatgggaaccagaggca gaaacagccttcaaagccttaaagcaggccttagtacaagctccagttttaagccttccc acaggacagaacttctctttatacgtcacagagagagccaggatagctcttggagtcctc actcagactcgtgggacaaccccacaaccagtggcatacctaagtaaggaaattgatgta gtagcaaaaggctggcctcactgtttaagggtagttgcagcagtggctgtcttagtgtca gaggccagcaaaataatacaaggaaaggatctcactgtctggactactcatgatgtaaat ggcatactaggtgccaaaggaagtttatggctatcagacaactgcctacttagataccag gcactactccttgagggactggtgcttcaaatacgcatgtgcgtggccctcaaccctgcc acttttctcccagaggatggggaaccaatcgagcatgactgccaacaaattatagtccag acttatgccacccgagatgatctcttagatgtccccttaactattcctgaccttaaccta tataccgatggaagttcagttgtggagaatgggatacaaaggacaggttacaccatagtt agtgatgtaaccatacttgaaagtcccgtggcagccatcttgctgttacttgcctttggg gcctgtatttttaaccttcttgtcaaatttgtttcctctagaatcgaggccttcgagcta cagatggtcttacaaatggaaccccaaatgagttcaactaacaacttctaccgaggaccc atggaccgacccgctggcacttcccctggcctagagagttcccctctgaaggacactaca actgcaggggcttgggacctgcagcccgccatgcctgagcctcccacgccctccgtgggc tcctgtgtagcccgagcctccccgatgagtgccgccccctgctccatggcacccagtccc attgaccacccaagggctgaggagtgtgggcacggcacgggactggcaggcagctcccct gcagcccctgtgcgggatccactgggtgaagccagctgggctcctgagtctgatattagg aacaatattacagtgggggtgtacaccccctgtgatattgggagtaataatatcttctcc ccccttgatattatgaacaatatcacaaagggggcgtacacctcctgcgatattgggagt aatatcatcctctctgtccctggatattag >gi568815590r:29237029_29450278|GENSCAN_predicted_peptide_3|77_aa MYVSLNTLMCTPPHLPLLQFLERAGHPMQGGEDGQPDSPQSGRKRGYNAERTSAQVVKIV KMALLISSFRAADYLHP >gi568815590r:29237029_29450278|GENSCAN_predicted_CDS_3|234_bp atgtacgtaagcttgaatacgttgatgtgcaccccgccacatctcccccttcttcaattc ttagagcgtgctggtcatccaatgcaaggcggagaagatgggcagcctgactcaccgcaa tccggcagaaaaagaggctacaacgcggagcgtaccagcgctcaggtggtcaaaatagtt aaaatggccttgctcatatcctctttccgggcagcggactacctgcacccataa >gi568815590r:29237029_29450278|GENSCAN_predicted_peptide_4|394_aa MVTMEELREMDCSVLKRLMNRDENGGGAGGSGSHGTLGLPSGGKCLLLDCRPFLAHSAGY ILGSVNVRCNTIVRRRAKGSVSLEQILPAEEEVRARLRSGLYSAVIVYDERSPRAESLRE DSTVSLVVQALRRNAERTDICLLKGGYERFSSEYPEFCSKTKALAAIPPPVPPSATEPLD LGCSSCGTPLHDQGGPVEILPFLYLGSAYHAARRDMLDALGITALLNVSSDCPNHFEGHY QYKCIPVEDNHKADISSWFMEAIEYIDAVKDCRGRVLVHCQAGISRSATICLAYLMMKKR VRLEEAFEFVKQRRSIISPNFSFMGQLLQFESQVLATSCAAEAASPSGPLRERGKTPATP TSQFVFSFPVSVGVHSAPSSLPYLHSPITTSPSC >gi568815590r:29237029_29450278|GENSCAN_predicted_CDS_4|1185_bp atggtgacgatggaggagctgcgggagatggactgcagtgtgctcaaaaggctgatgaac cgggacgagaatggcggcggcgcgggcggcagcggcagccacggcaccctggggctgccg agcggcggcaagtgcctgctgctggactgcagaccgttcctggcgcacagcgcgggctac atcctaggttcggtcaacgtgcgctgtaacaccatcgtgcggcggcgggctaagggctcc gtgagcctggagcagatcctgcccgccgaggaggaggtacgcgcccgcttgcgctccggc ctctactcggcggtcatcgtctacgacgagcgcagcccgcgcgccgagagcctccgcgag gacagcaccgtgtcgctggtggtgcaggcgctgcgccgcaacgccgagcgcaccgacatc tgcctgctcaaaggcggctatgagaggttttcctccgagtacccagaattctgttctaaa accaaggccctggcagccatcccacccccggttccccccagtgccacagagcccttggac ctgggctgcagctcctgtgggaccccactacacgaccaggggggtcctgtggagatcctt cccttcctctacctcggcagtgcctaccatgctgcccggagagacatgctggacgccctg ggcatcacggctctgttgaatgtctcctcggactgcccaaaccactttgaaggacactat cagtacaagtgcatcccagtggaagataaccacaaggccgacatcagctcctggttcatg gaagccatagagtacatcgatgccgtgaaggactgccgtgggcgcgtgctggtgcactgc caggcgggcatctcgcggtcggccaccatctgcctggcctacctgatgatgaagaaacgg gtgaggctggaggaggccttcgagttcgttaagcagcgccgcagcatcatctcgcccaac ttcagcttcatggggcagctgctgcagttcgagtcccaggtgctggccacgtcctgtgct gcggaggctgctagcccctcgggacccctgcgggagcggggcaagacccccgccaccccc acctcgcagttcgtcttcagctttccggtctccgtgggcgtgcactcggcccccagcagc ctgccctacctgcacagccccatcaccacctctcccagctgttag >gi568815590r:29237029_29450278|GENSCAN_predicted_peptide_5|131_aa MPAAREGAKEEEGGITDGSGTGRCQSEVHQYVCFSTQSLSQVKVLLPKVPQMPLPLRALD LKEAPTLSSTEPWGRCVDGTWCTDSSSSASSFRERSSGDVLHTLPSTAKEGHVLPTNLAL SLFFPTFFNCS >gi568815590r:29237029_29450278|GENSCAN_predicted_CDS_5|396_bp atgccagctgcaagggagggggccaaagaagaggaaggaggaatcacagatggctctggg acaggccgctgtcaatctgaagtccaccaatatgtatgcttctccacgcagtcactctct caggtgaaggtgttgctgcccaaagtgccccagatgccgctgcccctcagggccctggac ctcaaggaagcccccactctcagcagcacagagccatgggggagatgtgtggatggcact tggtgcaccgattccagcagctctgcatcctcctttagagagagaagctccggggacgtc ctccacacactgccttctacagcaaaggagggtcacgttcttccaactaacctggcactc tccctcttcttccccaccttcttcaactgctcttag >gi568815590r:29237029_29450278|GENSCAN_predicted_peptide_6|271_aa MAKTGLFEDLAILPPGFSEGQKIAKSIMQLATISSCPCKYPNIWILKLQPEDQILIGGSA LEVLLPHLRNTGVDLFPESAALTTQQLTARADGADSCYYPELLRRDARVCERERIIQDST EKEKKKKTEFLSKTSAQTSPFLEPRQGVVWLVFCKPSERGEKVFPWPLQFWHCEQDNQIH SAFLEAAVKGTQDLTSWQKDSQCTSICSTQDITDCQRCHTVLLWLFLVVIFAPQGDVNSQ HGGLIDPGKSNPAHVTLPCKTLQRLPSPNGQ >gi568815590r:29237029_29450278|GENSCAN_predicted_CDS_6|816_bp atggctaaaactggcctatttgaagatttggctatactccctcctggtttctcagaaggt cagaaaatagccaaaagcatcatgcaattggccaccatatcatcttgtccttgtaaatat cctaatatctggattctgaaactccaaccagaagaccagatcctgattgggggatctgcc cttgaggtattactgcctcacctcagaaacacaggtgtggacctgttcccagagtcagct gctctgaccactcagcagctgacagcacgagctgatggtgctgacagctgttattaccca gagctcctaaggagagatgccagagtatgtgaaagagaaagaatcatccaagattccaca gagaaggagaaaaagaagaaaactgaatttctctccaaaacctctgcacagacaagtccc tttctggaaccaaggcaaggggtggtgtggttggtgttctgcaaaccttcagagcgtggc gagaaagttttcccttggcccctgcagttttggcactgtgagcaggacaaccaaatccac tctgcttttctggaagctgcagtcaagggaacccaggacctgacaagctggcagaaggac tcccagtgtacttccatctgcagcacacaggacatcacagattgtcaacgctgtcacaca gtactgctctggttattcctcgtggttatctttgctcctcagggagatgtcaattctcaa catggtggcctgattgatcctggaaaatccaatccagctcatgtcactcttccgtgcaaa actctccaaaggcttccatcacccaacggccaatga