GENSCAN 1.0 Date run: 6-Nov-116 Time: 09:01:33 Sequence gi568815595f:12468870_12682250 : 213381 bp : 44.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3431 3574 144 1 0 96 32 85 0.395 4.18 1.02 Intr + 4300 4397 98 0 2 123 63 9 0.495 0.71 1.03 Intr + 4794 4903 110 0 2 64 85 53 0.563 2.53 1.04 Intr + 7394 7480 87 0 0 81 66 83 0.585 5.34 1.05 Intr + 15111 15196 86 1 2 67 85 44 0.277 1.54 1.06 Intr + 15392 15648 257 1 2 82 116 127 0.769 11.14 1.07 Intr + 15869 15979 111 2 0 81 41 68 0.241 0.89 1.08 Term + 18174 18396 223 0 1 81 41 63 0.098 -2.71 1.09 PlyA + 18801 18806 6 1.05 2.00 Prom + 20331 20370 40 -2.16 2.01 Init + 20932 21120 189 0 0 65 106 197 0.607 18.21 2.02 Intr + 23267 23348 82 1 1 57 98 23 0.457 -0.59 2.03 Intr + 28255 28372 118 2 1 104 77 22 0.487 2.22 2.04 Intr + 34393 34915 523 1 1 98 100 282 0.766 23.35 2.05 Intr + 50190 50328 139 2 1 37 102 142 0.734 10.54 2.06 Intr + 60019 60055 37 2 1 121 58 41 0.992 1.72 2.07 Intr + 60893 61004 112 2 1 105 87 81 0.991 10.08 2.08 Intr + 65390 65429 40 1 1 60 90 64 0.565 1.60 2.09 Term + 65611 65621 11 2 2 102 39 7 0.422 -4.44 2.10 PlyA + 67603 67608 6 1.05 3.03 PlyA - 67800 67795 6 1.05 3.02 Term - 71564 71324 241 1 1 74 54 283 0.991 19.00 3.01 Init - 73064 72991 74 2 2 78 86 75 0.668 6.84 3.00 Prom - 74296 74257 40 -5.86 4.00 Prom + 85626 85665 40 -4.96 4.01 Init + 88200 88363 164 2 2 75 78 117 0.790 6.62 4.02 Intr + 93837 94025 189 0 0 60 101 76 0.793 4.80 4.03 Intr + 100006 100128 123 1 0 109 -14 92 0.303 0.80 4.04 Intr + 101202 101383 182 0 2 76 81 37 0.466 1.31 4.05 Intr + 103200 103435 236 1 2 91 85 359 0.350 33.31 4.06 Intr + 105923 106137 215 1 2 82 97 143 0.960 12.11 4.07 Intr + 107762 107872 111 2 0 82 98 59 0.981 5.79 4.08 Intr + 112939 113083 145 1 1 134 86 96 0.967 14.28 4.09 Term + 113247 113384 138 2 0 66 43 206 0.999 11.86 4.10 PlyA + 114477 114482 6 1.05 5.15 PlyA - 114753 114748 6 1.05 5.14 Term - 115788 115645 144 0 0 59 33 130 0.981 2.51 5.13 Intr - 116112 115978 135 0 0 98 110 36 0.982 7.56 5.12 Intr - 116384 116253 132 2 0 20 111 195 0.756 15.84 5.11 Intr - 118768 118645 124 0 1 118 -22 13 0.093 -6.11 5.10 Intr - 122105 121929 177 1 0 130 64 156 0.266 16.43 5.09 Intr - 122923 122839 85 2 1 70 101 58 0.995 4.18 5.08 Intr - 130939 130822 118 1 1 69 115 53 0.881 6.14 5.07 Intr - 131410 131283 128 2 2 39 68 58 0.515 -0.70 5.06 Intr - 131546 131519 28 2 1 113 80 35 0.829 2.89 5.05 Intr - 135420 135267 154 2 1 82 75 52 0.745 3.37 5.04 Intr - 140054 139897 158 2 2 90 61 184 0.846 14.71 5.03 Intr - 140466 140364 103 2 1 63 78 60 0.818 2.68 5.02 Intr - 143193 143081 113 0 2 87 90 39 0.807 3.18 5.01 Init - 149852 149646 207 2 0 64 68 243 0.999 18.72 5.00 Prom - 152431 152392 40 -3.56 6.00 Prom + 152779 152818 40 -0.86 6.01 Init + 170195 170333 139 1 1 39 72 89 0.430 2.70 6.02 Intr + 170484 170886 403 1 1 52 49 187 0.012 4.69 6.03 Intr + 194835 195017 183 0 0 56 76 62 0.091 0.70 6.04 Term + 195225 195393 169 0 1 59 43 202 0.442 10.15 6.05 PlyA + 196415 196420 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 170484 170890 407 1 2 52 49 193 0.942 7.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:12468870_12682250|GENSCAN_predicted_peptide_1|371_aa KRGQYILALKNQRSRSALPLPIGMILWIESFVISSSAQSARTRIPGAQPTQEVASSLPRG LRPFGIWDKDFYVAPIPDGVLEKCNCQRDRIHVSSSQHVSSLQAHSAESQVQIKMVLGLE QGIRKCSGDSQEGSTEKEESEEASGTRSLGTQRLLARCLPEKTSLQVPRMLQRTQDRSGF RTILQRAAKRARSLRLILRSGRAGAQNTRQPAVRRRTPPSPDGSSAAAHSAGFAAALLDG SGVRPTAGVAGVTASAWGQESPWFLETPAPCSGLVGLGRKVQLTMPERSRSLWWSSVGSI PRGGVAGSCRNYVFDILRNCQTPPVLFDLKMKLGLEKVGCARSHNNNGGFACLEGLLRAK FLRGVQPHEQE >gi568815595f:12468870_12682250|GENSCAN_predicted_CDS_1|1116_bp aaaaggggccagtacatcctggcactgaaaaatcagcgcagccgttcagcactacccctg cctataggaatgattctctggatagaatcattcgtcatttcctcatctgcacagtcagct agaacaagaattcctggagctcaacccacccaggaagtggcctccagcctccctcgtggt ctgagaccatttggcatttgggataaggatttctatgtggcccctatcccagatggtgtc ttagagaaatgtaactgccaaagggaccggatacacgtgtcctccagccaacacgtcagc tccctgcaggcacactccgcagaaagccaagttcaaatcaaaatggtgcttggtttagag caagggattcggaaatgctctggagacagtcaagaaggaagcacagaaaaggaagagtca gaggaagcctctggaacacgcagtctgggcacacagaggcttctggccaggtgtctgcca gagaaaacgtccttgcaagttccccgcatgctgcagcgtacacaggatcgttctggcttc cgtacaatcctgcagagagcagcaaagcgcgcacgcagcctccgcctcatactccgcagt ggcagagctggcgcacagaacacccgccagccggccgtgcgcaggcgcactccgccaagt cccgacgggagctctgcggctgcgcactcggctggctttgcggccgcactgcttgacggc agcggtgtccgccccacggccggcgttgccggggtaacggcgagcgcgtggggccaagaa agcccttggttcttggaaacgccggcgccttgttcagggctggtggggctggggcgcaag gtgcagctgacaatgcccgagaggagccgcagcctctggtggagttcggtcgggtctata cctaggggtggagttgctggatcatgtcgtaactatgtgtttgacattttgaggaactgc cagactcctccagtcctgttcgatttgaagatgaaactgggcttagagaaagtgggttgt gcaaggtcacacaataataatggcggctttgcatgtctggaaggcctactgcgagccaaa ttcctgaggggtgtgcagccacatgagcaagagtaa >gi568815595f:12468870_12682250|GENSCAN_predicted_peptide_2|416_aa MAEAVFHAPKRKRRVYETYESPLPIPFGQDHGPLKEFKIFRAEMINNNVIVRNAEDIEQL YGKGYFGKGILSRSRPSFTISDPKLVAKWKALFGVQHFPHVSYVGLDCRCLESRDQALLL SAVPHFVVPWYQHSVEWAAELMRRQGQDESTVRRILKDYTKPLEHPPVKRNEEAQVHDKL NSGMVSNMEGTAGGERPSVVNGDSGKSGGVGDPREPLGCLQEGSGCHPTTESFEKSVRED ASPLPHVCCCKQDALILQRGLHHEDGSQHIGLLHPGDRGPDHEYVLVEEAECAMSEREAA PNEEEPLTIVKLWKAFTVVQPTFRTTYMAYHYFRSKGWVPKVGLKYGTDLLLYRKGPPFY HASYSVIIELVDDHFEGSLRRPLSWKSLAALSRVSVNVSKCQKDSMTTTYGSKACG >gi568815595f:12468870_12682250|GENSCAN_predicted_CDS_2|1251_bp atggcagaagcagttttccatgccccaaagaggaaaagaagagtgtatgagacttacgag tctccattgccaatcccttttggtcaggaccatggtcctctgaaagaattcaagatattc cgtgctgaaatgattaacaacaatgtgattgtgaggaatgcggaggacattgagcagctc tatgggaaaggttattttggaaaaggtattctttcaagaagccgtccaagcttcacaatt tcagatcctaaactggttgctaaatggaaagccctcttcggcgtgcagcactttcctcat gtgtcatacgtgggtttagactgtcggtgcttggagagcagagatcaggcccttctcctc tctgcagtgccacactttgtggtcccttggtatcagcatagtgttgagtgggcagcagag ctgatgcgtagacaggggcaggatgagagtacagtgcgcagaatcctcaaggattacacg aaaccgcttgagcatcctcctgtgaaaaggaatgaagaggctcaagtgcatgacaagctt aactctggaatggtttccaacatggaaggcacagcagggggagagagaccttctgtggta aacggggactctggaaagtcaggtggtgtgggtgatccccgtgagccattaggctgcctg caggagggctctggctgccacccaacaacagagagctttgagaaaagcgtgcgagaggat gcctcacctctgccccatgtctgttgctgcaaacaagatgctctcatcctccagcgtggc cttcatcatgaagacggcagccagcacatcggcctcctgcatcctggggacagagggcct gaccatgagtacgtgctggtcgaggaagcggagtgtgccatgagcgagagggaggctgcc ccaaatgaggaagagcctttaacgatagtgaagctctggaaagctttcactgtagttcag cccacgttcagaaccacctacatggcctaccattactttcgaagcaagggctgggtgccc aaagtgggactcaagtacgggacagatttactgctatatcggaaaggccctccattttac catgcaagttattctgtcattatcgagctagttgatgaccattttgaaggctctctccgc aggcctctcagttggaagtccctggctgccttgagcagagtttccgttaatgtctctaag tgtcagaaagattccatgacgaccacctatggatccaaggcatgtggctaa >gi568815595f:12468870_12682250|GENSCAN_predicted_peptide_3|104_aa MYGMMEQWDKYLEDFSTSGAWLPHRYEDNHHNCYSYALTFINCVLMAEGRQQLDKGEFTE KYVVPRTRLASKFITLYRAIREHGFYVTDCPQQQAQPPEGGGLC >gi568815595f:12468870_12682250|GENSCAN_predicted_CDS_3|315_bp atgtatggaatgatggagcaatgggacaagtacctggaagacttctccacctcgggggcc tggctgcctcacaggtatgaagacaaccaccataactgctactcttacgcactcacgttc attaactgcgttctgatggcagaaggtagacagcaactggacaagggtgaatttacggag aagtacgtggtcccgcggacaaggctggcatccaagttcatcacactctaccgggcgata cgggagcatggcttctacgtcactgactgtccccagcagcaggcacaaccccctgagggc ggcggtttgtgctga >gi568815595f:12468870_12682250|GENSCAN_predicted_peptide_4|500_aa MGRARAKAEAAAAARGGGTTTVPQPSHHEHQADHLQVSALEPGASGRSPRPQGGRSAAAL DSNRSTNPIVSCACEGSTLCASYENLMPDELSLSPMTPRWDHLVAEKQAQCSTDSTLWYF MHGVCREGSQCLFSHDLANSKPSTICKYYQKGYCAYGTRYDHTRPSAAAGGAVGTMAHSV PSPAFHSPHPPSEVTASIVKTNSHEPGKREKRTLVLRDRNLSGMAERKTQPSMVSNPGSC SDPQPSPEMKPHSYLDAIRSGLDDVEASSSYSNEQQLCPYAAAGECRFGDACVYLHGEIC MLTFEHEMEKAFAFQASQDKVCSICMEVILEKASASERRFGILSNCNHTYCLSCIRQWRC AKQFENPIIKSCPECRVISEFVIPSVYWVEDQNKKNELIEAFKQGMGKKACKYFEQGKGT CPFGSKCLYRHAYPDGRLAEPEKPRKQLSSQGTVRFFNSVRLWDFIENRESRHVPNNEDV DMTELGDLFMHLSGVESSEP >gi568815595f:12468870_12682250|GENSCAN_predicted_CDS_4|1503_bp atgggccgggccagggccaaggccgaggcggcagcggctgcgagaggcggcggcacgacg acggtccctcagcccagccaccatgagcaccaagcagatcacttgcaggtcagtgcgctg gagccaggagcttcgggccgctcccccaggccgcaggggggccgatcagcagcagcatta gattctaatagaagcactaaccctattgtcagctgtgcatgtgagggatctacgttgtgt gcttcttatgagaatctaatgcctgatgaactgtcactgtctcccatgacccccagatgg gaccatctagttgcagaaaaacaagctcagtgttccactgattctacactatggtatttt atgcatggtgtgtgtcgggaaggaagtcagtgcctattctcacatgacttggcaaacagc aaaccgtccaccatctgcaagtactaccagaagggctactgtgcctatggaactcgatat gaccacacgaggccctctgctgcagctggaggtgctgtgggcaccatggcccacagtgtg ccctccccagctttccacagtcctcaccctccttccgaggtcactgcatccattgtgaaa actaactcacatgaacccggaaagcgtgaaaagagaacattggttcttagagaccgaaat ctctctggcatggctgaaaggaagacccagccgagcatggtgagtaatccaggcagctgc agcgacccccagcccagccccgagatgaagccgcattcctacctggatgccatcaggagt ggccttgatgacgtggaggccagcagctcctacagcaacgagcagcagctgtgcccctac gcagctgctggggagtgccggtttggggatgcctgtgtctacctgcacggggagatctgc atgttgacgttcgaacacgagatggaaaaggcctttgccttccaggcaagccaggacaaa gtgtgcagtatctgcatggaagtgatcctggagaaggcctctgcttctgagaggagattt gggattctctccaattgcaatcacacgtactgtttgtcctgcatccggcagtggcggtgt gccaaacagtttgaaaacccaatcattaagtcttgtccagaatgccgtgtgatatcagag tttgtaattccaagtgtgtattgggtggaagatcagaataaaaagaacgagttgattgaa gctttcaaacaggggatggggaaaaaagcctgtaaatactttgagcaaggcaaggggacc tgcccatttggaagcaaatgtctttatcgccatgcttaccccgatgggcggctagcagag cctgagaaacctcggaaacagctcagttctcaaggcactgtgaggttctttaattcagtg cggctctgggatttcatcgagaaccgagaaagccggcatgtccccaacaatgaagatgtc gacatgacagagctcggggacctcttcatgcacctttctggagtggaatcatcagaaccc taa >gi568815595f:12468870_12682250|GENSCAN_predicted_peptide_5|601_aa MEHIQGAWKTISNGFGFKDAVFDGSSCISPTIVQQFGYQRRASDDGKLTDPSKTSNTIRV FLPNKQRTVVNVRNGMSLHDCLMKALKVRGLQPECCAVFRLLHEHKGKKARLDWNTDAAS LIGEELQVDFLDHVPLTTHNFARKTFLKLAFCDICQKFLLNGFRCQTCGYKFHEHCSTKV PTMCVDWSNIRQLFSQHRYSTPHAFTFNTSSPSSEGSLSQRQRSTSTPNVHMVSTTLPVD SRMIEDAIRSHSESASPSALSSSPNNLSPTGWSQPKTPVPAQRERAPVSGTQEKNKIRPR GQRDSSYYWEIEASEVMLSTRIGSGSFGTVYKGKWHGDVAVKILKVVDPTPEQFQAFRNE VAVLRKTRHVNILLFMGYMTKDNLAIVTQWCEGSSLYKHLHVQETKFQMFQLIDIARQTA QGMDYLHAKNIIHRDMKSNSILWLLSSFDCSVLNLGKQKGGFLSQAPEVIRMQDNNPFSF QSDVYSYGIVLYELMTGELPYSHINNRDQIIFMVGRGYASPDLSKLYKNCPKAMKRLVAD CVKKVKEERPLFPQILSSIELLQHSLPKINRSASEPSLHRAAHTEDINACTLTTSPRLPV F >gi568815595f:12468870_12682250|GENSCAN_predicted_CDS_5|1806_bp atggagcacatacagggagcttggaagacgatcagcaatggttttggattcaaagatgcc gtgtttgatggctccagctgcatctctcctacaatagttcagcagtttggctatcagcgc cgggcatcagatgatggcaaactcacagatccttctaagacaagcaacactatccgtgtt ttcttgccgaacaagcaaagaacagtggtcaatgtgcgaaatggaatgagcttgcatgac tgccttatgaaagcactcaaggtgaggggcctgcaaccagagtgctgtgcagtgttcaga cttctccacgaacacaaaggtaaaaaagcacgcttagattggaatactgatgctgcgtct ttgattggagaagaacttcaagtagatttcctggatcatgttcccctcacaacacacaac tttgctcggaagacgttcctgaagcttgccttctgtgacatctgtcagaaattcctgctc aatggatttcgatgtcagacttgtggctacaaatttcatgagcactgtagcaccaaagta cctactatgtgtgtggactggagtaacatcagacaactcttttctcagcacagatattct acacctcacgccttcacctttaacacctccagtccctcatctgaaggttccctctcccag aggcagaggtcgacatccacacctaatgtccacatggtcagcaccaccctgcctgtggac agcaggatgattgaggatgcaattcgaagtcacagcgaatcagcctcaccttcagccctg tccagtagccccaacaatctgagcccaacaggctggtcacagccgaaaacccccgtgcca gcacaaagagagcgggcaccagtatctgggacccaggagaaaaacaaaattaggcctcgt ggacagagagattcaagctattattgggaaatagaagccagtgaagtgatgctgtccact cggattgggtcaggctcttttggaactgtttataagggtaaatggcacggagatgttgca gtaaagatcctaaaggttgtcgacccaaccccagagcaattccaggccttcaggaatgag gtggctgttctgcgcaaaacacggcatgtgaacattctgcttttcatggggtacatgaca aaggacaacctggcaattgtgacccagtggtgcgagggcagcagcctctacaaacacctg catgtccaggagaccaagtttcagatgttccagctaattgacattgcccggcagacggct cagggaatggactatttgcatgcaaagaacatcatccatagagacatgaaatccaacagt atcctttggttgttgagttcatttgactgctcggttctaaatttagggaaacagaaggga ggctttctatcacaagccccagaggtgatccgaatgcaggataacaacccattcagtttc cagtcggatgtctactcctatggcatcgtattgtatgaactgatgacgggggagcttcct tattctcacatcaacaaccgagatcagatcatcttcatggtgggccgaggatatgcctcc ccagatcttagtaagctatataagaactgccccaaagcaatgaagaggctggtagctgac tgtgtgaagaaagtaaaggaagagaggcctctttttccccagatcctgtcttccattgag ctgctccaacactctctaccgaagatcaaccggagcgcttccgagccatccttgcatcgg gcagcccacactgaggatatcaatgcttgcacgctgaccacgtccccgaggctgcctgtc ttctag >gi568815595f:12468870_12682250|GENSCAN_predicted_peptide_6|297_aa MQGWFNIHNPSYKQNQNQNYVIISIDAEKAFNKIQQPFMLKTLNKPVSEVLARAIRQEKE IKGIQLGKEEVKLSLFADDMIVYLENLIVSAQNLLKLTGNFSKVSGYKINVQKSQAFLYT NNRQTESQIMSELSFTIASKRIKYLGIRLTRDVKDLFKENYKPLLNKIKEDTNGRTFHAH GRGAAPSAPPRPGRPGSRSLSPAARLPRHPRPGPATYLREPGRPNVLSFGGSFSPAPPPR GGWRPKRPRLRLSGGPDERALDSVLVLPTCDATGSPILREPYWQSTSNLARASIALA >gi568815595f:12468870_12682250|GENSCAN_predicted_CDS_6|894_bp atgcaaggctggttcaacatacataatccatcatataaacagaaccaaaatcaaaactac gtgattatctcaatagatgcagaaaaggccttcaacaaaattcaacagcccttcatgcta aaaactctcaataaaccagtgtcggaagttctggccagagcaatcaggcaagagaaagaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatg attgtatatttagaaaacctcatcgtctcagcccaaaatctccttaagctgacaggcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcccaagcattcctatacacc aataacagacaaacagagagccaaatcatgagtgaactctcattcacaattgcttcaaag agaataaaatacctaggaatccgacttacaagggatgtgaaggacctcttcaaggagaac tacaaaccactgctcaacaaaataaaagaggacacaaatggaagaacattccatgctcat ggccgcggggccgctccatcagcgccacccaggcccggtcgccctggaagccgatccctc agccccgccgcccggctcccccggcatccacgacccggtcctgccacctacctgagggag ccaggccgccccaacgtcctgtcgttcggcggcagcttctcgcccgctcctcctccccgc ggcggatggcggcccaagcgcccgcgattaagactctcgggcggcccagacgagcgagcc ctcgactcggtgctcgtcctcccgacctgcgacgccaccggctctccgattctgcgcgag ccctactggcagtcgacttctaacttggctcgggcatccatcgctctggcctga