GENSCAN 1.0 Date run: 5-Nov-116 Time: 14:37:13 Sequence gi568815590f:30285093_30647420 : 362328 bp : 41.90% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1019 1168 150 0 0 78 115 49 0.528 5.81 1.02 Intr + 19247 19446 200 0 2 24 55 127 0.000 1.15 1.03 Intr + 31234 31371 138 0 0 72 56 126 0.050 7.54 1.04 Term + 41013 41165 153 2 0 44 45 124 0.299 0.64 1.05 PlyA + 43691 43696 6 1.05 2.00 Prom + 43728 43767 40 -4.35 2.01 Init + 47849 48028 180 1 0 83 48 109 0.089 5.63 2.02 Intr + 65142 65348 207 2 0 95 63 132 0.409 9.75 2.03 Intr + 66940 67563 624 0 0 29 22 876 0.407 66.61 2.04 Term + 67651 68250 600 0 0 72 48 925 0.994 80.44 2.05 PlyA + 68411 68416 6 1.05 3.00 Prom + 76770 76809 40 -4.75 3.01 Init + 98880 99066 187 2 1 67 36 170 0.523 8.97 3.02 Intr + 99968 100298 331 0 1 18 78 159 0.134 1.66 3.03 Intr + 110777 110906 130 2 1 82 2 113 0.191 1.88 3.04 Term + 137887 138000 114 0 0 95 48 123 0.201 6.59 3.05 PlyA + 139189 139194 6 1.05 4.03 PlyA - 139785 139780 6 1.05 4.02 Term - 155520 155393 128 1 2 61 48 142 0.088 4.96 4.01 Init - 157636 157351 286 1 1 102 89 310 0.081 27.79 4.00 Prom - 164919 164880 40 -5.45 5.00 Prom + 172936 172975 40 -5.65 5.01 Init + 188937 188939 3 2 0 91 80 0 0.585 -0.45 5.02 Intr + 189687 189764 78 2 0 104 77 21 0.656 1.43 5.03 Intr + 192707 192745 39 1 0 113 110 14 0.920 3.60 5.04 Intr + 194223 194285 63 2 0 99 101 75 0.983 7.90 5.05 Intr + 219194 219344 151 1 1 73 103 136 0.653 12.41 5.06 Intr + 259402 259532 131 2 2 123 91 78 0.989 11.09 5.07 Intr + 265628 265871 244 1 1 46 41 171 0.633 4.35 5.08 Intr + 270271 270363 93 2 0 53 56 81 0.109 0.42 5.09 Intr + 271183 271555 373 2 1 65 21 235 0.105 7.70 5.10 Intr + 272023 272233 211 1 1 37 58 169 0.523 6.79 5.11 Intr + 276459 276634 176 2 2 17 81 188 0.919 8.82 5.12 Intr + 280759 280951 193 1 1 111 32 119 0.910 7.17 5.13 Intr + 281084 281171 88 1 1 98 89 86 0.927 8.32 5.14 Intr + 281385 281514 130 1 1 92 82 66 0.910 5.23 5.15 Intr + 287193 287344 152 0 2 67 89 133 0.955 10.19 5.16 Intr + 290887 291001 115 2 1 74 38 172 0.814 9.39 5.17 Term + 291240 291396 157 0 1 19 42 146 0.578 -0.48 5.18 PlyA + 291670 291675 6 1.05 6.04 PlyA - 293247 293242 6 1.05 6.03 Term - 293945 293829 117 2 0 106 50 135 0.992 9.06 6.02 Intr - 295304 295189 116 0 2 47 62 163 0.677 8.85 6.01 Init - 310438 310336 103 1 1 67 64 68 0.134 2.75 6.00 Prom - 310876 310837 40 -6.15 7.07 PlyA - 311354 311349 6 1.05 7.06 Term - 312041 311734 308 0 2 -83 48 344 0.303 7.59 7.05 Intr - 322058 321965 94 2 1 59 105 75 0.857 4.92 7.04 Intr - 327389 327207 183 2 0 65 84 224 0.999 18.76 7.03 Intr - 329623 329516 108 1 0 55 94 97 0.985 6.56 7.02 Intr - 334840 334709 132 1 0 114 63 40 0.449 4.02 7.01 Intr - 350031 349940 92 1 2 83 115 66 0.065 7.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 19269 19180 90 1 0 94 75 59 0.904 4.25 S.002 Init - 19563 19554 10 0 1 93 94 0 0.817 1.97 S.003 Sngl - 155677 155393 285 1 0 60 48 228 0.902 11.29 S.004 Sngl - 157636 157292 345 1 0 102 40 372 0.842 27.59 S.005 Term + 266089 266273 185 2 2 117 36 140 0.893 8.42 S.006 Sngl - 339704 339186 519 2 0 60 55 224 0.936 12.09 S.007 Intr + 356070 356209 140 0 2 59 103 24 0.935 0.19 S.008 Term + 360354 360505 152 1 2 71 41 80 0.834 -1.31 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:30285093_30647420|GENSCAN_predicted_peptide_1|213_aa XASAKPILAKKLSLNPPARRDCSHFCFLTALESSCSIYNLPPYPPVNAQEACLAVISTLE KRRETAHTISIMLVKEGMLVAWSCCDKVPQTRWLKQQTFIVSQQSKIKVSAGLLLLRILL PQQEGDQRDARQVLNTSSQALLEDVANIPGDLETQGLLGTAFSVQEHREEKLGKVDVVSN KSNEFSWELPILKSEVFSRDVPDTMKDFHLEVE >gi568815590f:30285093_30647420|GENSCAN_predicted_CDS_1|642_bp naggccagcgcaaaacccatcttggccaagaagttgtccctaaaccctcctgctagaagg gactgctcccacttttgctttcttacagctcttgaatcctcctgcagcatttataacctt ccaccttatcctccagttaatgcacaggaagcgtgtttagcggtaataagtactctggaa aagagaagggaaacagcgcatacgatttcaataatgctggtcaaggaaggtatgttggtt gcctggagctgctgtgacaaagtaccacaaacaaggtggcttaaacaacagacatttatt gtgtcacagcagtccaaaatcaaggtttcggcaggtttgctcctcctgaggatccttctt ccgcagcaggaaggagaccagagagatgccaggcaggtcttaaacacatcctcccaagcc ctcttagaggatgttgcaaacatccctggagacctggagacccagggccttctgggaaca gctttctctgtccaggaacacagagaggaaaaactaggaaaggtggatgtggttagcaac aagagcaatgagttcagctgggaacttcccatcttgaagagtgaggtatttagcagagat gtaccagacacaatgaaagatttccacttggaagttgagtaa >gi568815590f:30285093_30647420|GENSCAN_predicted_peptide_2|536_aa MPGQKIAFVIQTCIDVLWGKGFLLQSRQQPPGAQQYVPTSPTTETQMNTILIEMEGKTLE CSEHGYFLLSLELRKQLKGDAAQSLAVNGDVKDHGGVDCDLWHRALGGPSICKAFLFLTS LTILRSTDKAGQRGNQIGAKFWEVISDEHGIDPTGTYHGDSDLQLDRISVYYSEATDGKY VPRAILVDLEPGTMDSVRSGPFGQIFRPDNFVFGQSGAGNNWAKGHYTEGAELVDSVLDV VRKEAESCDCLQGFQLTHSLGGGTGSGMGTLLISKIREEYPDRIMNTFSVVPSPKVSDTV IESYNATLSVHQLVENTDETYCIDNEALYDICFRTLRLNADLCKLAVNMVPFPRLHFFMP GFAPLTSHGSQQYRALTVPELTQQVFDAKSMMAACDPRHGRYLTVAADFRGRMSMKEVDE QMFNVQNKNSSYFVEWIPNNIKTAVCDIPPRGLKMAVTFIGSSTAIQELFKHISEQFTAM FRRKAFLHWYTGEGMDEMEFTEADSNMNDLVSEYQQYQDATAEEEEDFGEEAKEEA >gi568815590f:30285093_30647420|GENSCAN_predicted_CDS_2|1611_bp atgcctggccaaaaaattgcctttgttattcaaacttgcattgatgtactgtggggaaag ggctttctgctgcagagcagacagcagccaccaggagcccagcagtatgttcctacttcc ccaaccacagaaacccagatgaatacgattttgatagaaatggaaggaaagacattagag tgctcagagcatggatattttctgctgtctttggaacttcgcaaacaactcaaaggagat gcagctcaaagtctcgccgtcaatggcgatgtcaaagaccatggtggggttgactgtgac ttgtggcatagggctctgggtggccccagcatctgcaaagcattcttgtttttgacgagc ttaacaattttaagaagcactgataaggctggtcagcgtggcaaccagatcggtgccaag ttctgggaggtgatcagtgatgaacatggcatcgaccccactggcacctaccacggggac agcgacctgcagctggaccgcatctccgtgtactacagtgaagccactgatggcaaatat gttcctcgtgccatactggtggatctagaacctgggaccatggactctgttcgctcaggt ccttttggccagatctttagaccagacaactttgtatttggtcagtctggggcaggtaac aactgggccaaaggccactacacagagggcgccgagctggttgattctgtcctggatgtg gtacggaaggaggcagagagctgtgactgcctgcagggcttccagctcacccactcactg ggtgggggcacaggctctggaatgggcactctccttatcagcaagatccgagaagaatac cctgatcgcatcatgaataccttcagtgtggtgccttcacccaaagtgtctgacaccgtg atcgagtcctacaatgctaccctctccgtccatcagttggtagagaacactgatgagacc tattgcattgacaacgaggccctctatgatatctgcttccgcactctgaggctcaatgct gacctctgcaagttggcagttaacatggtccccttcccacgtctccatttctttatgcct ggctttgcccctctcaccagccatggaagccagcagtatcgagctctcacagtgccggaa ctcacccagcaggtcttcgatgccaagagcatgatggctgcctgtgacccccgccacggc cgatacctcaccgtggctgctgacttccgtggtcggatgtccatgaaggaggtcgatgag cagatgtttaacgtgcagaacaagaacagcagctactttgtggaatggatccccaacaat atcaagacagctgtctgtgacatcccacctcgtggcctcaagatggcagtcaccttcatc ggcagtagcacggccatccaggagctcttcaagcacatctcggagcagttcactgccatg ttccgccggaaggccttcctccactggtacacaggcgagggcatggacgagatggagttc accgaggctgacagcaacatgaacgacctcgtctctgagtatcagcagtaccaggatgcc accgcagaagaggaggaggatttcggtgaggaggccaaagaggaggcctaa >gi568815590f:30285093_30647420|GENSCAN_predicted_peptide_3|253_aa MGLKPCPEKAGPEWAVPGVPRVLMLSFKQQRKNRVWLGKGGLRARGANPSVKAGKLKVAH IPALPGPARKDREDEQRRQSREGEHPERGQPSGGGGTGRLGVVAGATRDPVRAAGGARAR GAAVEEGSGMARAPDAAPQCGPGRVDLGSVCFCVHARLRAPNCVTPRTFRRLLAFVQAIL YARKALHSDIISLILAIHLDLNSDVTPERPLTSKFKVHFVAIMFIAAIPSGVENQDLAPG LRAVEMYLAKSMP >gi568815590f:30285093_30647420|GENSCAN_predicted_CDS_3|762_bp atgggtttgaagccctgtcctgaaaaagccggcccggagtgggctgtccccggtgtccct cgagttctcatgctgtcattcaagcagcagaggaagaaccgggtctggctcgggaagggt gggcttagggccaggggtgcaaatccctcggtaaaagccggcaaactaaaagtcgcacac atcccagccctgcccggcccggcgaggaaggaccgggaagatgaacaacggcggcaaagc cgagaaggagaacaccccgagcgaggccaaccttcaggaggaggaggtactgggcggctc ggtgtggtggcgggggcgacgcgggacccagtgcgggcggccggcggggcgcgggcccgg ggcgcggcggtggaagaaggttcaggcatggcccgtgccccggacgcagccccacagtgt ggcccggggagggtggatctcgggagcgtgtgtttttgtgtccacgcgcgtctgcgggcc ccaaattgcgtaactccaaggacttttcggagacttttggcctttgtgcaagccattctg tatgcccggaaagccctacactcagacattatcagcctcatccttgccattcatttagat ctcaattcagatgtcaccccagagagacctttgacctccaagtttaaagttcattttgta gccatcatgttcattgcagctatcccaagtggggttgagaaccaggacctggctccaggt ctccgagcagttgaaatgtacctggcaaagtccatgccttag >gi568815590f:30285093_30647420|GENSCAN_predicted_peptide_4|137_aa MAASVGRATRSAAAHLTQLPPAPRAQRTSPAQPDEGKRRDADPWRTGPTVNKTGSIPGRL RGWENAIKMEKAATGCEGLQKQKIKRQKPTFWSCHALFTIARWWKQPKHPFDRRMEKYDG SLKKEGNSGTRYDMDEF >gi568815590f:30285093_30647420|GENSCAN_predicted_CDS_4|414_bp atggcggccagtgtgggccgcgcaaccagaagtgcggccgcgcacctgacccagcttccg cctgcacctagagctcagcgcaccagcccggctcagccagacgaaggcaaacgaagagat gcggatccctggaggactggccccaccgtgaacaaaacaggaagcattccaggaagactg cgggggtgggagaatgcaattaaaatggaaaaggcagcgactggctgcgaaggtttacag aaacagaagataaagcgccagaaacccacattctggagttgccacgcattattcacaata gccaggtggtggaagcaacccaaacatccatttgacagaaggatggaaaaatacgatgga agccttaaaaaggaaggaaattctggcacacgctacgacatggatgaattttga >gi568815590f:30285093_30647420|GENSCAN_predicted_peptide_5|798_aa MVRTLFVSGLPLDIKPRELYLLFRPFKGYEGSLIKLTSKQPVGFVSFDSRSEAEAAKNAL NGIRFDPEIPQTLRLEFAKANTKMAKNKLVGTPNPSTPLPNTVPQFIAREPYELTVPALY PSSPEVWAPYPLYPAELAPALPPPAFTYPASLHAQLTPALGELIQKRGLPKPGWTRPPEP PSDRKYIRGGGGWAAESQRRNGPCPHELLYTNPLSLTFPAAALSHFPIGETGERITVEKI PRVVHVALDVTGCVKCLARSLTQTKCQARNPPSCHSGVRLDQADESLKTGCTLVPNPAGT CASENVRAVCESWSERLGQGFLLAVTCTESRTMPSTVQLSSPGSPPPPVSPVKHRHVLAS SGLSAPTHTQAAPVTPLRSTLTQCTQTHHPLLWSSDRWSYLAWSPEETQCTGSLTESWCR QKLRYVLSLRFLTQASEERTAREEAHGEEEVPRPPCARTRDKHKLTVEEVGETGREAPCQ AHLRAKSQQRKSSPQRVESGLEKQGAPATSVSQMLQESQAWAFGQSSGSRPQPLGCNHRD VGVESCLLGLQPPNAPSAKSSHAFLGMSFLAGPQKPPLTCSAKSSNRPVFKTEAAPGGGY CAPLTGDLSPSQVPGDSARSSLTSAPTRPAREGHPRTATDTFLKPITPVFGAHHVLLSKT NYMYPTLQTRRQQISFPLSGFGRAKQALAIELRHKEIKGGSRYRGKPENFKRLTDKDRAV HSQSLERWGWKERDVLRCLSTKPLLNSRATRNGQRREHDFNLVFLVDMFSDTQKNCEETA SLYSIPRRCCRLGSRPSQ >gi568815590f:30285093_30647420|GENSCAN_predicted_CDS_5|2397_bp atggtccggaccctatttgtcagtggccttcctctggatatcaaacctcgggagctctat ctgcttttcagaccatttaagggctatgagggttctcttataaagctcacatctaaacag cctgtaggttttgtcagttttgacagtcgctcagaagcagaggctgcaaagaatgctttg aatggcatccgcttcgatcctgaaattccgcaaacactacgactagagtttgctaaggca aacacgaagatggccaagaacaaactcgtagggactccaaaccccagtactcctctgccc aacactgtacctcagttcattgccagagagccatatgagctcacagtgcctgcactttac cccagtagccctgaagtgtgggccccgtaccctctgtacccagcggagttagcgcctgct ctacctcctcctgctttcacctatcccgcttcactgcatgcccagcttacgccagcactg ggagagcttatccaaaagaggggcctccccaaaccaggctggacacggccaccggaaccc ccttctgacaggaaatacatccggggtggagggggctgggcagctgaatcccagcgacgg aatggcccttgccctcacgagttactttacacaaacccgctgtccctgacctttccagct gcagccttatctcactttcctatcggagaaactggagaaagaatcacagtggagaagatt cccagggtcgtccacgttgccttagatgtcacaggctgtgtgaagtgcttagcgcggtct ctgacgcagactaaatgtcaggctcggaacccgccttcatgtcactctggagtgcgcctc gaccaggctgacgagagcctgaagacgggctgcacgctcgtccccaaccctgcaggcacc tgtgcgagtgagaacgtgagggctgtgtgcgagagctggtcagagaggctggggcagggc ttccttctcgcagtcacctgcaccgaaagcagaaccatgccatccactgtgcagctctcc tccccgggcagccctccccctcctgtgtctcctgtgaaacaccgtcacgtcttggcctcg tcggggctcagcgccccaacccacacccaggccgcaccagtcacaccactgcgctccaca ctgacccagtgcacacagacccatcaccctctactctggtcatctgatcgctggtcatat ctggcatggtcgccagaagagacccaatgcacaggcagcctgacggagtcgtggtgcagg cagaagctccggtatgtcctaagcctgcgtttcctgactcaggccagcgaggaaagaacc gctcgtgaggaggcccatggggaggaggaggttcccaggccgccttgtgctcggaccagg gacaagcataaactgaccgttgaggaagttggtgaaactggaagagaggccccttgccaa gcacatctgagagccaagagtcagcagagaaagagcagcccccagagagtggagagcggg ctggagaagcagggtgcgccggccacttctgtcagtcagatgcttcaggaaagtcaagca tgggcctttggacagtcctccgggtctaggccacagcccttgggatgcaaccacagggac gtgggtgtagagtcctgcttgctgggcctgcagcctcctaatgcgccctcagcgaagagt tcacatgcttttcttggaatgtccttcctcgctggcccccagaagcctcctcttacttgc tccgccaagtcctcaaacaggccggtcttcaagaccgaggcagccccaggtggaggttac tgtgcaccgttaacgggtgatctttctccatcccaggtcccaggtgattctgcccggagc tctttaacatctgctccaacacggcctgccagggaaggtcatcccaggacagcaactgat acatttttaaagcccatcaccccagtatttggtgctcaccatgtcctccttagcaagacc aactacatgtatcccacactgcagacaagacgccagcagatttcattcccattgtcaggc ttcggcagagctaagcaagcattggccattgagcttaggcacaaagagattaaaggagga agcagataccggggaaagccagagaatttcaaacggctcacagataaagacagggctgtc cactcgcagtcccttgaacgctggggctggaaggaacgggatgtcctgcgctgcctttcc actaagcctctgctcaacagcagagccaccagaaacgggcagaggcgagaacatgacttc aacttggtgtttctggtggatatgttttcagacacccagaagaactgcgaagagacagct tctctgtatagcatccctcggagatgctgcagacttggttccaggccatcgcaataa >gi568815590f:30285093_30647420|GENSCAN_predicted_peptide_6|111_aa MKDKCSLRETEMERIHCHQTCTTGNIKGSCSGRREFQKLWRSVTVDSMDEEKIEEYLKRQ GISSMQESGPKKVAPIQRRKKPASQKKRRFKTHNEHLAGVLKDYSDITSSK >gi568815590f:30285093_30647420|GENSCAN_predicted_CDS_6|336_bp atgaaggacaaatgctctctcagagaaacagaaatggagagaattcattgccatcagacc tgcactactggaaatattaaaggaagttgttcaggaagaagagaatttcagaaactgtgg aggagtgtcactgtagattccatggacgaggagaaaattgaagaatatctgaagcgacag ggtatttcttccatgcaggaatctggaccaaagaaagtggcccctattcagagaaggaaa aagcctgcttcacagaaaaagcgacgctttaagactcataacgaacacttggctggagtg ctgaaggattactctgacattacttccagcaaatag >gi568815590f:30285093_30647420|GENSCAN_predicted_peptide_7|305_aa XHSNGSFNLKALSGSSGYKFGVLAKIVNYMKGHLEICNNVSYLWILGLIMPMKSLLSCKV TQPQVDLDVHVFGGRTRHQRGDTHPLTLDEILDETQHLDIGLKQKQWLMTEALVNNPKIE VIDGKYAFKPKYNVRDKKALLRLLDQHDQRGLGGILLEDIEEALPNSQKAVKALGDQILF VNRPDKKKILFFNDKSCQFSVDEVTKEDSVCPRPRLNPNVRESSHPYEANSCGLTGDTGD RCKAQERGGRPAQMQRKRKTKKAEGRTDTEENILWQTSHYPNTVRSWWYIVGNHGSNKFQ TTLLF >gi568815590f:30285093_30647420|GENSCAN_predicted_CDS_7|918_bp natcatagcaatggatcatttaacttgaaagctttgtcaggaagctctggatataagttt ggtgttcttgctaagattgtgaattacatgaagggacacctggaaatctgcaataatgtc tcctatctctggatccttggtttaatcatgccaatgaagtcccttttgtcatgcaaggta acacagccacaggttgatttggatgtgcatgtttttggagggaggacacggcatcagcga ggagatacgcatcctctaaccttagatgaaattttggatgaaacacaacatttagatatt ggactcaagcagaaacaatggctaatgactgaggctttagtcaacaatcccaaaattgaa gtaatagatgggaagtatgctttcaagcccaagtacaacgtgagagataagaaggcccta cttaggctcttagatcagcatgaccagcgaggattaggaggaattcttttagaagacata gaagaagcactgcccaattcccagaaagctgtcaaggctttgggggaccagatactattt gtaaatcgtcccgataagaagaaaatacttttcttcaatgataagagctgtcagttttct gtggatgaagtcaccaaggaagacagtgtttgcccaagaccaaggcttaatcctaatgtc agagaatcttcccacccatacgaggctaacagctgtggactgacaggtgacactggagat cggtgtaaggcgcaagaacgtggaggcagaccagctcagatgcagcgtaaaaggaagacc aaaaaagctgaaggtcgaacagacactgaggaaaacatcctctggcaaaccagccactac cctaacacagttcgaagctggtggtacattgtgggtaaccatggcagcaacaaattccaa accacgcttctgttctga