GENSCAN 1.0 Date run: 7-Nov-116 Time: 21:05:31 Sequence gi568815587r:117740319_117942776 : 202458 bp : 49.55% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 7062 7184 123 2 0 59 97 36 0.472 1.77 1.02 Intr + 8764 8951 188 0 2 100 99 56 0.611 6.39 1.03 Intr + 18423 18489 67 0 1 69 91 6 0.025 -2.09 1.04 Intr + 25565 25660 96 1 0 59 87 46 0.471 1.81 1.05 Intr + 26747 26926 180 1 0 101 51 107 0.603 8.36 1.06 Term + 33000 33179 180 2 0 100 54 86 0.562 4.01 1.07 PlyA + 33288 33293 6 -0.45 2.07 PlyA - 33918 33913 6 1.05 2.06 Term - 35656 35634 23 2 2 105 32 8 0.432 -4.63 2.05 Intr - 36619 36473 147 2 0 89 110 169 0.990 19.41 2.04 Intr - 40492 40175 318 2 0 115 110 602 0.998 61.03 2.03 Intr - 47851 47680 172 1 1 68 60 64 0.032 1.12 2.02 Intr - 56300 56177 124 1 1 91 35 44 0.067 -0.01 2.01 Init - 59508 59438 71 0 2 94 95 41 0.259 5.92 2.00 Prom - 62795 62756 40 -3.36 3.13 PlyA - 63045 63040 6 -0.45 3.12 Term - 64823 64630 194 0 2 35 48 142 0.669 2.48 3.11 Intr - 67438 67334 105 2 0 51 84 52 0.508 1.29 3.10 Intr - 74082 73971 112 0 1 83 30 105 0.120 4.15 3.09 Intr - 78994 78856 139 0 1 69 61 -2 0.052 -4.33 3.08 Intr - 80378 80348 31 0 1 124 85 15 0.196 1.99 3.07 Intr - 82399 82088 312 2 0 100 89 274 0.954 24.86 3.06 Intr - 94858 94762 97 1 1 92 59 52 0.023 2.28 3.05 Intr - 96947 96892 56 0 2 97 89 -18 0.110 -2.10 3.04 Intr - 100050 100001 50 2 2 131 48 94 0.359 8.02 3.03 Intr - 100866 100830 37 1 1 116 107 -7 0.994 1.32 3.02 Intr - 101547 101473 75 1 0 107 97 65 0.998 8.79 3.01 Init - 102458 102401 58 2 1 106 103 141 0.995 16.88 3.00 Prom - 107782 107743 40 -4.66 4.00 Prom + 110931 110970 40 -5.06 4.01 Init + 114296 114436 141 1 0 81 53 70 0.653 2.93 4.02 Term + 116135 116161 27 1 0 106 55 55 0.437 2.07 4.03 PlyA + 119393 119398 6 1.05 5.02 PlyA - 122957 122952 6 1.05 5.01 Sngl - 135458 135276 183 2 0 96 48 167 0.573 6.56 5.00 Prom - 148803 148764 40 -3.76 6.17 PlyA - 148851 148846 6 1.05 6.16 Term - 161546 161325 222 2 0 61 50 144 0.923 4.82 6.15 Intr - 163489 163337 153 1 0 115 94 123 0.995 15.87 6.14 Intr - 163783 163641 143 2 2 28 94 97 0.914 4.37 6.13 Intr - 165418 165320 99 2 0 62 95 26 0.564 0.78 6.12 Intr - 168466 168294 173 0 2 112 93 419 0.998 44.39 6.11 Intr - 169650 169488 163 1 1 89 86 114 0.965 10.33 6.10 Intr - 170432 170389 44 1 2 139 68 -11 0.461 -0.12 6.09 Intr - 171542 171450 93 1 0 106 80 25 0.836 2.58 6.08 Intr - 173588 173459 130 0 1 114 76 119 0.999 13.05 6.07 Intr - 174196 174074 123 2 0 130 91 156 0.996 20.66 6.06 Intr - 176956 176852 105 2 0 115 65 137 0.599 14.29 6.05 Intr - 177503 177397 107 1 2 108 59 -29 0.919 -3.94 6.04 Intr - 178128 178091 38 0 2 51 110 47 0.580 0.26 6.03 Intr - 178520 178219 302 0 2 73 76 229 0.877 16.65 6.02 Intr - 185967 185852 116 2 2 112 89 82 0.351 10.79 6.01 Init - 198867 198761 107 0 2 70 93 43 0.546 2.70 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:117740319_117942776|GENSCAN_predicted_peptide_1|277_aa MHSILVISMAWVTSDQSVNELSLSFPHPSSITHHQHNTERKVEHLHSQRGCRGRKDEAQN YQLLLQPPLPWFPGPNQARRGLSRDEGLETCWKPDAVAELCLPRIEPYRHGHLRLQIHIA KCFPEKLHFITKETPSKPHFFRAKQEQMVAFFQQPRKQMCLGGFSKMMPVNGLYKPESTI QTEGDAGRRAQMEKWPKLSPRAEVKLPWAQILPRHFLRLLLDLWFVKLSPALQIQKSKST QFLWLRKLILKCQREVSLALPSSINLQGNLPCNLPLP >gi568815587r:117740319_117942776|GENSCAN_predicted_CDS_1|834_bp atgcacagcatccttgttatttccatggcctgggtgaccagtgaccagtcagtcaatgag ctctcactgtcctttccccatccctcctcaatcacgcaccaccagcacaatactgaacgc aaggtagaacatcttcattcccagagaggatgcagaggacgaaaggatgaggcccagaac taccagctcctcctccagccccctctgccctggttccctggtcccaaccaggccagacgg ggcttgagccgggacgagggtttggaaacatgctggaaaccagatgctgtggcggagctg tgccttcccaggatagagccttacagacacggacatctgagacttcagatacatattgcc aaatgctttccagaaaagcttcacttcatcacaaaagagactccaagcaagccccacttc ttcagagcaaaacaggagcagatggtggcttttttccagcaaccacggaaacagatgtgt ttgggaggattcagtaagatgatgccggtgaacgggctttataaaccggagagcaccata caaaccgaaggtgatgccggccgcagggcacagatggagaagtggcccaagttatctcca agagctgaagtcaaactgccctgggctcagattctgcctcgtcatttcctcaggctcctg ctggacttgtggtttgtgaagctgagcccagccctgcagatccagaaatcaaagtcaaca cagttcctctggctccggaagctgatcctgaagtgtcagagggaagtatccctggctctt ccatcctccattaaccttcagggcaacttgccatgtaatcttcccctaccatga >gi568815587r:117740319_117942776|GENSCAN_predicted_peptide_2|284_aa MGLVLFNPRYDFPRQMLGMSPFHWPRTRRLIPPLCPCVPLPPRTPALARMLAHRMALIRA LAGGRDHTSSGCLVHALMLSGKLNPLLASVWAQTVSKMQHKGFLVKQQDTVWLVLLWVLN GLARPEDVGTSLYFVNDSLQQVTFSSSVGVVVPCPAAGSPSAALRWYLATGDDIYDVPHI RHVHANGTLQLYPFSPSAFNSFIHDNDYFCTAENAAGKIRSPNIRVKAVFREPYTVRVED QRSMRGNVAVFKCLIPSSVQEYVSVVSWEKDTVSIIPGPLFPML >gi568815587r:117740319_117942776|GENSCAN_predicted_CDS_2|855_bp atggggcttgtcttatttaatcctcgctacgactttccaaggcagatgctggggatgtcc ccattccactggcctcggactcggcggctgattcctccgctatgtccgtgcgtccctctc ccgccgcggaccccagccctggcgcgaatgttggcgcaccgcatggcgcttatccgcgca ttggctggaggcagggaccataccagctcaggctgcctggtacatgctctgatgctgtct gggaaactcaaccctctccttgccagtgtctgggctcagacagtttccaagatgcaacat aaagggtttttagtgaagcagcaagacaccgtctggctggtcctcctctgggttttgaat ggacttgcccgccctgaagatgttggcaccagcctctactttgtaaatgactccttgcag caggtgaccttttccagctccgtgggggtggtggtgccctgcccggccgcgggctccccc agcgcggcccttcgatggtacctggccacaggggacgacatctacgacgtgccgcacatc cggcacgtccacgccaacgggacgctgcagctctaccccttctccccctccgccttcaat agctttatccacgacaatgactacttctgcaccgcggagaacgctgccggcaagatccgg agccccaacatccgcgtcaaagcagttttcagggaaccctacaccgtccgggtggaggat caaaggtcaatgcgtggcaacgtggccgtcttcaagtgcctcatcccctcttcagtgcag gaatatgttagcgttgtatcttgggagaaagacacagtctccatcatcccaggacctttg tttcccatgctgtaa >gi568815587r:117740319_117942776|GENSCAN_predicted_peptide_3|421_aa MELVLVFLCSLLAPMVLASDYQTLRIGGLVFAVVLFSVGILLILSRRCKCSFNQKPRAPG DEEAQVENLITANDGQQRQLPHPLLCLSVVRAAFGKTQAYGRAQAQPVNPVIMAIKNYLV TARPGGSPKGDVDPFYYGKPGPLRTLPEPSGPLPPSSGLSQPQVHALCPLSPLVTTGCCG QAAERDSCWERPPIPLLLPSLSGDYETVRNGGLIFAGLAFIVGLLILLSKSMKMSRNSRG LEGWRDCGILAPPSVWLPHLGLQAFEGGEGRTGRRTKLQDQEMNRLRSQALESADLDLSP GLANYELSFMRIRVKRTTKQALFQLYLILKDNLHGGAPIPPKGFSYSVFGRHLLAMAGLQ AKEASSVEVRGIVVQSDWHVSPGNPTKHWTTVREDQLPTEGVMALDWSCSDLIRIHESIL F >gi568815587r:117740319_117942776|GENSCAN_predicted_CDS_3|1266_bp atggagttggtgctggtcttcctctgcagcctgctggcccccatggtcctggccagtgat taccagaccctgaggattgggggactggtgttcgctgtggtcctcttctcggttgggatc ctccttatcctaagtcgcaggtgcaagtgcagtttcaatcagaagccccgggccccagga gatgaggaagcccaggtggagaacctcatcaccgccaatgatgggcagcagaggcaactc ccgcatcctttgctctgcctgtcagtggtcagagcggcctttggaaagacacaggcttat ggtagagctcaagctcaacctgtcaacccagtgatcatggcgattaaaaactatctggtc acagcaagacctggcggcagccccaagggggacgtggacccgttctactatggtaagcct gggcccctgcgcacccttcctgagccctcaggaccccttccaccaagcagcggcctctcc cagccccaggtccatgctctgtgccccttatctcccctggttaccacgggctgctgcggg caggctgcggagagagacagctgctgggagagaccacccatcccgctcctcttgccctct ctttccggagactatgagaccgttcgcaatgggggcctgatcttcgctggactggccttc atcgtggggctcctcatcctcctcagcaaatcaatgaagatgagccgtaacagcagaggc ctcgaaggctggagggattgtgggattctggcaccacctagcgtctggcttcctcatctg ggcctgcaagcttttgagggaggagaggggagaactggcaggagaaccaagctacaggac caggaaatgaacaggctaaggtcacaggctttggaatcagcagacctggatttgagccct ggcttagccaattatgaactaagtttcatgcgcatccgtgtgaagagaaccaccaaacag gctttgttccagctgtatctaatattgaaagataatctccatggaggagcccccatcccg cccaaaggtttctcatactcagtctttgggcgacacctgctggccatggcaggattacag gccaaggaggccagttcagtagaagttcgtgggatagtggtccagagtgattggcatgtc agcccaggaaatccaacaaaacattggacaacagtaagagaagatcagctacctactgag ggggtgatggcgctggactggagctgctcagacctcatccggattcatgagtctattctg ttttaa >gi568815587r:117740319_117942776|GENSCAN_predicted_peptide_4|55_aa MTRTSQQVQTPCNNSPERQREGEQRAQQGAREVSVEELTFQRNLKSRAQPTDDKL >gi568815587r:117740319_117942776|GENSCAN_predicted_CDS_4|168_bp atgacgagaacttcccaacaggtacagacgccatgcaacaatagtccagaaagacagaga gaaggggagcagagagctcaacagggagctagggaagtttccgtggaggagctgacattc cagcggaaccttaagagcagggctcagcccacggatgacaagctctga >gi568815587r:117740319_117942776|GENSCAN_predicted_peptide_5|60_aa MGLPRHLLQACTLAGPWLLEASPHQCALCHYFLCLCGFVLSGWAKISSFEDGSVGISGPL >gi568815587r:117740319_117942776|GENSCAN_predicted_CDS_5|183_bp atgggtctgccgcgtcacctgctgcaagcctgcacgctcgcgggaccgtggctactggag gcttctccccaccagtgtgctctctgccactacttcctttgtctctgtggcttcgtgctg tctggctgggctaagatttccagctttgaagatggatcagtgggcatctcaggccctctc tga >gi568815587r:117740319_117942776|GENSCAN_predicted_peptide_6|705_aa MRFLTEDEGHEDTHRVRLFLLCFSGLSSIWNMTVTLCVALTSSSKDPGTCPMPLQVCIPH LVLQGPWHLPYAPAECISSKNTFSWSISSPGISSWDTSRPGISSPGISSPGISSWDTSGP GISSPGISSWYTSRPGISRPGISSPGISSPGISSPGISGSGITFQVLIRQVIIRQVSTSN QGHQGEPRIQTVPTTTIHTGLDDTWLWRITRSLGPIPQPCPLQGTSLPKFTWREGQKQLP LIGCVLLLIALVVSLIILFQFWQGHTGIRYKEQRESCPKHAVRCDGVVDCKLKSDELGCV RFDWDKSLLKIYSGSSHQWLPICSSNWNDSYSEKTCQQLGFESAHRTTEVAHRDFANSFS ILRYNSTIQESLHRSECPSQRYISLQCSHCGLRAMTGRIVGGALASDSKWPWQVSLHFGT THICGGTLIDAQWVLTAAHCFFVTREKVLEGWKVYAGTSNLHQLPEAASIAEIIINSNYT DEEDDYDIALMRLSKPLTLSAHIHPACLPMHGQTFSLNETCWITGFGKTRETDDKTSPFL REVQVNLIDFKKCNDYLVYDSYLTPRMMCAGDLRGGRDSCQGDSGGPLVCEQNNRWYLAG VTSWGTGCGQRNKPGVYTKVTEVLPWIYSKMETSPVWADASGPGGAIAVLGWIVGFGGCS FPGPGPSSVKSCSRWLYEHQVLTQTPCWCRGCHQSDCAVANQADN >gi568815587r:117740319_117942776|GENSCAN_predicted_CDS_6|2118_bp atgaggttcctcactgaagacgaaggtcatgaggacacgcacagagtcaggctgttcctg ctctgttttagtggcctgagcagtatttggaacatgacagttacactgtgtgtagccctc acctcgtcctccaaggaccctggcacctgccctatgcctctgcaggtgtgcatccctcac ctcgtcctccaaggaccctggcacctgccctatgcccctgcagaatgcatctccagcaag aacaccttcagctggagcatctccagcccaggcatctccagctgggacacctccaggccg ggcatctccagcccaggcatctccagcccaggcatctccagctgggacacctccgggccg ggcatctccagcccaggcatctccagctggtacacctccaggccgggcatctccaggccg ggcatctccagcccaggcatctccagcccaggcatctccagcccgggcatctccggctct ggcatcactttccaggtcctcatccggcaggtcatcatccgccaggtcagcaccagcaac cagggccaccagggagagcccagaattcagactgtccccaccaccacaatacacacaggc ctggatgatacgtggctttggaggatcactcgaagtttggggcccatcccacagccttgc cctctccagggtacgagcctgcccaagttcacctggcgggagggccagaagcagctaccg ctcatcgggtgcgtgctcctcctcattgccctggtggtttcgctcatcatcctcttccag ttctggcagggccacacagggatcaggtacaaggagcagagggagagctgtcccaagcac gctgttcgctgtgacggggtggtggactgcaagctgaagagtgacgagctgggctgcgtg aggtttgactgggacaagtctctgcttaaaatctactctgggtcctcccatcagtggctt cccatctgtagcagcaactggaatgactcctactcagagaagacctgccagcagctgggt ttcgagagtgctcaccggacaaccgaggttgcccacagggattttgccaacagcttctca atcttgagatacaactccaccatccaggaaagcctccacaggtctgaatgcccttcccag cggtatatctctctccagtgttcccactgcggactgagggccatgaccgggcggatcgtg ggaggggcgctggcctcggatagcaagtggccttggcaagtgagtctgcacttcggcacc acccacatctgtggaggcacgctcattgacgcccagtgggtgctcactgccgcccactgc ttcttcgtgacccgggagaaggtcctggagggctggaaggtgtacgcgggcaccagcaac ctgcaccagttgcctgaggcagcctccattgccgagatcatcatcaacagcaattacacc gatgaggaggacgactatgacatcgccctcatgcggctgtccaagcccctgaccctgtcc gctcacatccaccctgcttgcctccccatgcatggacagacctttagcctcaatgagacc tgctggatcacaggctttggcaagaccagggagacagatgacaagacatcccccttcctc cgggaggtgcaggtcaatctcatcgacttcaagaaatgcaatgactacttggtctatgac agttaccttaccccaaggatgatgtgtgctggggaccttcgtgggggcagagactcctgc cagggagacagcggggggcctcttgtctgtgagcagaacaaccgctggtacctggcaggt gtcaccagctggggcacaggctgtggccagagaaacaaacctggtgtgtacaccaaagtg acagaagttcttccctggatttacagcaagatggagactagcccagtgtgggcagatgcc agcggcccaggtggcgccattgctgtcctgggatggatcgtgggttttggtggatgcagc ttcccagggcctggaccgtcttcggtgaaaagctgctcccgttggctttatgagcatcaa gtcctcacccagaccccctgctggtgccgtggatgtcaccagtcggactgtgctgtggct aaccaggctgacaactga