GENSCAN 1.0 Date run: 6-Nov-116 Time: 02:19:23 Sequence gi568815594f:107795554_108010550 : 214997 bp : 39.30% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3171 3292 122 2 2 18 116 91 0.646 3.27 1.02 Intr + 3476 3579 104 2 2 88 70 53 0.914 2.40 1.03 Term + 4503 4753 251 1 2 56 49 147 0.493 2.48 1.04 PlyA + 10641 10646 6 1.05 2.00 Prom + 11295 11334 40 -2.45 2.01 Init + 12275 12456 182 1 2 47 49 110 0.675 1.70 2.02 Term + 13681 13819 139 1 1 88 50 121 0.433 4.75 2.03 PlyA + 14039 14044 6 1.05 3.05 PlyA - 14243 14238 6 1.05 3.04 Term - 29248 29152 97 0 1 70 48 208 0.911 11.46 3.03 Intr - 41879 41784 96 1 0 141 61 37 0.468 4.51 3.02 Intr - 49790 49686 105 1 0 130 18 45 0.316 0.11 3.01 Init - 52114 51219 896 1 2 36 53 217 0.594 6.84 3.00 Prom - 56480 56441 40 -4.55 4.04 PlyA - 56905 56900 6 1.05 4.03 Term - 62612 62499 114 2 0 86 49 102 0.928 3.69 4.02 Intr - 63168 63076 93 0 0 6 95 120 0.789 3.74 4.01 Init - 67248 67228 21 0 0 85 92 14 0.605 1.29 4.00 Prom - 68200 68161 40 -3.25 5.00 Prom + 84449 84488 40 -5.45 5.01 Init + 100001 100455 455 1 2 85 108 366 0.982 33.68 5.02 Intr + 104022 104139 118 0 1 115 86 12 0.918 3.25 5.03 Intr + 107680 107833 154 0 1 80 101 191 0.988 18.32 5.04 Intr + 113012 113178 167 0 2 76 100 64 0.583 5.16 5.05 Term + 115728 115841 114 2 0 63 42 96 0.521 0.09 5.06 PlyA + 116823 116828 6 1.05 6.00 Prom + 134404 134443 40 -2.95 6.01 Init + 136091 136634 544 1 1 82 96 541 0.610 47.40 6.02 Intr + 149417 150052 636 0 0 114 86 211 0.952 14.64 6.03 Term + 151823 152019 197 0 2 56 49 239 0.898 13.39 6.04 PlyA + 152118 152123 6 1.05 7.00 Prom + 154550 154589 40 -5.55 7.01 Sngl + 162288 162701 414 2 0 16 49 348 0.975 19.74 7.02 PlyA + 163031 163036 6 1.05 8.00 Prom + 163567 163606 40 -6.15 8.01 Sngl + 163660 164310 651 0 0 49 45 293 0.797 17.02 8.02 PlyA + 164371 164376 6 1.05 9.00 Prom + 191670 191709 40 -2.95 9.01 Init + 193966 194511 546 0 0 46 99 367 0.140 28.55 9.02 Intr + 194726 194832 107 1 2 25 34 133 0.001 -0.31 9.03 Intr + 200460 200616 157 0 1 52 89 75 0.001 3.09 9.04 Intr + 214206 214334 129 2 0 74 86 171 0.941 15.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 201932 201844 89 2 2 81 89 67 0.874 6.26 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_1|158_aa GLEGRPGIEIHTLKRTHVPATEIEDTGMTVTPEHSLPTCGSLDREIFLQLRKDHKLVAFR KDVTSRSLFGQHDVAVGRCDSESPQDFAEHYCGNALTSDFLASPLGTHSLRRAWVVGRTG QRIVPAAVTPSLESGCPCPLADIAVFLSQGRKLQSCGQ >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_1|477_bp ggcttagagggaaggccgggcattgagatccacacgctgaagcggacacatgtcccagcc acagaaatagaagacactggaatgactgtaacaccagaacacagcctacctacctgcgga agtttggacagagaaattttcctacagctaagaaaagatcataaattggtagcctttagg aaggatgtgaccagcagatccttgtttggacaacacgatgtggctgtgggacgttgtgat tctgagagtccacaggattttgcagagcactactgtggaaatgctcttacttctgacttc ctagcctcccctctgggcactcattccctccgaagggcttgggtggtgggaagaacaggg cagaggattgtgccagcggcagtcactccctcactggagagcggttgcccctgtccctta gcagatatcgctgtcttcctctcacagggcaggaaactgcagtcatgtggacagtag >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_2|106_aa MGLSRIVIKPEGTEGILESGGHREWNGMEGTKTGGWRCLQKLLLILTERWKGALGQGSDN SLQLRQAQSQGSKDDHGLRWQEEPAEKHGFGVMKITEGSESQSQFS >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_2|321_bp atgggcttaagcaggatagtgataaaaccagaaggcacagaaggcatcttagagtctggt ggccacagagagtggaatggaatggaggggaccaaaactggaggctggagatgccttcag aagctgctgttgatcctaactgagagatggaagggggcacttgggcagggtagtgacaac agccttcaactaagacaggcccagtcccagggaagcaaagatgatcacggactcagatgg caagaagaacctgctgaaaaacatggatttggggtcatgaagataacagaaggaagtgaa agtcagagccaattcagttga >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_3|397_aa MQRNTQTLVKNSFPLKFIWNQKTARIAKSILSQKNKAGGITLPDFKLYYKATVTKTAWYW YQNRDIDQWNRTEPSEITPHIYNYLIFDKPEKNKQWGKDSIFNKWCWENWLAISRKLKLD PFLTPYTKINSRWIKDLNVRPKTIKTLEENLGITIQDIGMGKDFMSKTPKAMAIKAKIDK WDLIKLKSFCTAKETTIRVNRQPTKWEKIFAIYSSDKGLISRIYNELKQIYKKKTNKPIK KWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTTMRYHLTPVRMAIIKKSGNNRS RAPLLLLQHMMITLSCFLLPVPNPDCSQSLGEGRLYILFTLLSLSQPLQLSVNLTSSDKP FLTVLGDSPVGPVAAVPVPAAVALLAQSGARKTCATS >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_3|1194_bp atgcaacgcaacacccaaacattagtcaaaaacagctttcctttaaagttcatatggaac caaaaaacagcccgcatcgccaagtcaatcctaagccaaaagaacaaagctggaggcatc acactacctgacttcaaactatactacaaggctacagtaaccaaaacagcatggtactgg taccaaaacagagatatagatcaatggaacagaacagagccctcagaaataacgccgcat atctacaactatctgatctttgacaaacctgagaaaaacaagcaatggggaaaggattcc atatttaataaatggtgctgggaaaactggctagccataagtagaaagctgaaactggat cccttccttacaccttatacaaaaatcaattcgagatggattaaagacttaaatgttaga cctaaaaccataaaaaccctagaagaaaacctaggcattaccattcaggacataggcatg ggcaaggacttcatgtctaaaacaccaaaagcaatggcaataaaagccaaaattgacaaa tgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaac aggcaacctacaaaatgggagaaaattttcgcaatctactcatctgacaaagggctaata tccagaatctacaatgagctcaaacaaatttacaagaaaaaaacaaacaagcccatcaaa aagtgggcaaaggatatgaacagacacttctcaaaagaagacatttatgcagccaaaaga cacatgaaaaaatgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaatg agataccatctcacaccagttagaatggcaatcattaaaaagtcaggaaacaacagatca cgggctcctttactactcctgcaacatatgatgatcactctttcttgtttcctgcttcca gtaccaaatccagactgttctcagagccttggagaaggaaggttgtacattctgttcact ctgctgagtctttctcagcctttacaactcagtgtaaatctgacctcttctgacaaacct ttcctgactgtcctaggtgactcacccgtcggcccggtggcagcagtgcccgttcctgct gccgtggctctcctggcgcagagtggagctcgaaaaacctgcgcgacttcttaa >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_4|75_aa MKTTQAQSYDIYECLNQRREDSSSKAAGQLIDSQPSADEHTDKQEDQSMSLKKVSQSQGK ADVVDVEQVVSGKPF >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_4|228_bp atgaagacaactcaggcacagagttacgacatttatgaatgtcttaaccagagaagggaa gatagcagcagcaaagctgcaggacaactgatagacagccaaccttcagctgatgaacac acagataaacaagaagaccagtcaatgtcactcaaaaaagtcagtcagagtcagggcaag gctgatgtggtcgatgtagagcaagtggtgtcagggaagcctttctaa >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_5|335_aa MDIIETAKLEEHLENQPSDPTNTYARPAEPVEEENKNGNGKPKSLSSGLRKGTKKYPDYI QIAMPTESRNKFPLEWWKTGIAFIYAVFNLVLTTVMITVVHERVPPKELSPPLPDKFFDY IDRVKWAFSVSEINGIILVGLWITQWLFLRYKSIVGRRFCFIIGTLYLYRCITMYVTTLP VPGMHFQCAPKLNGDSQAKVQRILRLISGGGLSITGSHILCGDFLFSGHTVTLTLTYLFI KEYSPRHFWWYHLICWLLSAAGIICILVAHEHYTIDVIIAYYITTRLFWWYHSMANEKIH RKHEMVSANEWPLSTHLDAVCPVNLDLVPQGSQLF >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_5|1008_bp atggatatcatagagacagcaaaacttgaagaacatttggaaaatcaacccagtgatcct acgaacacttatgcaagacccgctgaacctgttgaagaagaaaacaaaaatggcaatggt aaacccaagagcttatccagtgggctgcgaaaaggcaccaaaaagtacccggactatatc caaattgctatgcccactgaatcaaggaacaaatttccactagagtggtggaaaacgggc attgccttcatatatgcagttttcaacctcgtcttgacaaccgtcatgatcacagttgta catgagagggtccctcccaaggagcttagccctccactcccagacaagttttttgattac attgatagggtgaaatgggcattttctgtatcagaaataaatgggattatattagttgga ttatggatcacccagtggctgtttctgagatacaagtcaatagtgggacgcagattctgt tttattattggaactttatacctgtatcgctgcattacaatgtatgttactactctacct gtgcctggaatgcatttccagtgtgctccaaagctcaatggagactctcaggcaaaagtt caacggattctacgattgatttctggtggtggattgtccataactggatcacatatctta tgtggagacttcctcttcagcggtcacacggttacgctgacactgacttatttgttcatc aaagaatattcgcctcgtcacttctggtggtatcatttaatctgctggctgctgagtgct gccgggatcatctgcattcttgtagcacacgaacactacactatcgatgtgatcattgct tattatatcacaacacgactgttttggtggtaccattcaatggccaatgaaaagatacac aggaaacatgagatggtttctgctaatgagtggcccttgagtacacacttagatgctgtc tgccctgtaaatttggatctggtgccccagggcagtcaactcttctag >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_6|458_aa MSSPGPSQPPAEDPPWPARLLRAPLGLLRLDPSGGALLLCGLVALLGWSWLRRRRARGIP PGPTPWPLVGNFGHVLLPPFLRRRSWLSSRTRAAGIDPSVIGPQVLLAHLARVYGSIFSF FIGHYLVVVLSDFHSVREALVQQAEVFSDRPRVPLISIVTKEKGEREVVGCGYADAADES PGVVFAHYGPVWRQQRKFSHSTLRHFGLGKLSLEPKIIEEFKYVKAEMQKHGEDPFCPFS IISNAVSNIICSLCFGQRFDYTNSEFKKMLGFMSRGLEICLNSQVLLVNICPWLYYLPFG PFKELRQIEKDITSFLKKIIKDHQESLDRENPQDFIDMYLLHMEEERKNNSNSSFDEEYL FYIIGDLFIAGTDTTTNSLLWCLLYMSLNPDVQEKVHEEIERVIGANRAPSLTDKAQMPY TEATIMEVQRLTVVVPLAIPHMTSENTGKSRVFLFECP >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_6|1377_bp atgtcgtctccggggccgtcgcagccgccggccgaggacccgccctggcccgcgcgcctc ctgcgtgcgcctctggggctgctgcggctggaccccagcgggggcgcgctgctgctatgc ggcctcgtagcgctgctgggctggagctggctgcggaggcgccgggcgcggggcatcccg cccgggcccacgccctggcctctggtgggcaacttcggtcacgtgctgctgcctcccttc ctccggcggcggagctggctgagcagcaggaccagggccgcagggattgatccctcggtc ataggcccgcaggtgctcctggctcacctagcccgcgtgtacggcagcatcttcagcttc tttatcggccactacctggtggtggtcctcagcgacttccacagcgtgcgcgaggcgctg gtgcagcaggccgaggtcttcagcgaccgcccgcgggtgccgctcatctccatcgtgacc aaggagaagggtgagcgggaggtcgtgggctgtgggtacgcggatgccgcggatgagtct ccaggggttgtgtttgcacattatggtcccgtctggagacaacaaaggaagttctctcat tcaactcttcgtcattttgggttgggaaaacttagcttggagcccaagattattgaggag ttcaaatatgtgaaagcagaaatgcaaaagcacggagaagaccccttctgccctttctcc atcatcagcaatgccgtctctaacatcatttgctccttgtgctttggccagcgctttgat tacactaatagtgagttcaagaaaatgcttggttttatgtcacgaggcctagaaatctgt ctgaacagtcaagtcctcctggtcaacatatgcccttggctttattaccttccctttgga ccatttaaggaattaagacaaattgaaaaggatataaccagtttccttaaaaaaatcatc aaagaccatcaagagtctctggatagagagaaccctcaggacttcatagacatgtacctt ctccacatggaagaggagaggaaaaataatagtaacagcagttttgatgaagagtactta ttttatatcattggggatctctttattgctgggactgataccacaactaactctttgctc tggtgcctgctgtatatgtcgctgaaccccgatgtacaagaaaaggttcatgaagaaatt gaaagagtcattggcgccaaccgagctccttccctcacagacaaggcccagatgccctac acagaagccaccatcatggaagtgcagaggctaactgtggtggtgccgcttgccattcct catatgacctcagagaacacaggcaagtccagggtcttcctctttgaatgcccttga >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_7|137_aa MIGNNKLLQAKGNVRTHCKEAKNLEKILHEWLTRINSVEKTLNDLMELKTMAQELRDACT SFSSRFDQLEERVSVIEEQMNEMKREEKFREKSVKRNKQSLQEIWDEVKRPNLRLTGVPE SDGENGTKLENTLQDII >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_7|414_bp atgattggtaataacaaacttctccaagctaaagggaatgttcgaacccattgcaaagaa gctaaaaaccttgaaaaaatattacatgaatggctaactagaataaacagtgtagagaag accttaaatgacctgatggagctaaaaaccatggcacaagaactacgtgacgcatgtaca agcttcagtagccgattcgatcaactggaagaaagggtatcagtgattgaagagcaaatg aatgaaatgaagcgagaagagaagtttagagaaaaaagtgtaaaaagaaacaaacaaagc ctccaagaaatatgggacgaggtaaaaagaccaaatctacgtctgactggtgtacctgaa agtgacggggagaatggaaccaagttggaaaacactctgcaggatattatctag >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_8|216_aa MGDFNTPLSTLDRSTRQEVNKDIQELNSALHQVDLIDIYRTLQPKSTEYTFFSVPHCTYS KIDHVVGSKALLSKCKRTEIITNCLSDHSAIKLELRIKKLTQNRSTTWKLNNLLLNDDWV HNKMKAEIKMFLETNENKDTTYQNLWDTFKAVCKGKFIALNAHKRKQERSKLDTLTSQLK ELAKQEQTHSKASRRQEITDQTRTEGDTGTKIPSKN >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_8|651_bp atgggagactttaacaccccactgtcaacattagacagatcaacgagacaggaagttaac aaggatatccaggaattgaactcagctctgcaccaagtggacctaatagacatctataga actctccaacccaagtcaacagagtatacattcttctcagtaccacattgcacttattcc aaaattgaccacgtagttggaagtaaagcactcctcagcaaatgtaaaagaacagaaatt ataacaaactgtctctcagaccacagtgcaatcaaactagaactcaggattaagaaacta actcaaaaccgctcaactacatggaaactgaacaacctgctcctgaatgacgactgggta cataacaaaatgaaggcagaaataaagatgttccttgaaaccaatgagaacaaagacaca acataccagaatctttgggacacatttaaagcagtgtgtaaagggaaatttatagcacta aatgcccacaagagaaagcaggaaagatctaaacttgacaccctaacatcacaattaaaa gaactagcgaagcaagagcaaacacattcaaaagctagcagaaggcaagaaataacagat cagaccagaactgaaggagatacaggcacaaaaatcccttcaaaaaattga >gi568815594f:107795554_108010550|GENSCAN_predicted_peptide_9|313_aa MHMSSKDSILQSQHHSLHRTRGQWLFRRRVCSRPHTRTPAARSRCSAGLKPALTRAPAAD SCEPARVYPLNAGTLQPGPMGRAGLEAPPPPCGVTGTPGARGLQGRVGPRPQSLAFRGCP PRASSLPGSPRCRRRCHTMAFVTRQFMRSVSSSSTASASAKKIIVKHVTVIGGGLMGAGI AQLPRLENSGCPQGPLKVPWRVAVNAFWKRAVQEKKELELAPPVAAVVLPATCTLKMGSQ SVSPYQKRLCNFLVMGELFPLPSPLQFLDQVAAATGHTVVLVDQTEDILAKSKKGIEESL RKVAKKKFAENLK >gi568815594f:107795554_108010550|GENSCAN_predicted_CDS_9|939_bp atgcacatgtcatcaaaggatagcattttacagagtcaacaccactcacttcatagaaca aggggccagtggcttttcagaagaagggtctgttctcggccccacaccagaacgccagcc gccaggtcccgctgctcagcaggtctcaagcccgcgctcactcgggctccggctgctgac agctgcgagcccgcgcgtgtatacccgctcaacgctgggacgttacagccagggccaatg ggcagagcgggactcgaggccccgcccccgccttgtggcgtcacggggacgccgggggcg cgcgggctgcagggccgcgtaggtccccgcccccagagtctggctttccgcggctgcccg cctcgcgcgtcttccctgcccgggtctcctcgctgtcgccgccgctgccacaccatggcc ttcgtcaccaggcagttcatgcgttccgtgtcctcctcgtccaccgcctcggcctcggcc aagaagataatcgtcaagcacgtgacggtcatcggcggcgggctgatgggcgccggcatt gcccagcttcctcgtttggaaaattctggctgcccccagggcccgctgaaggttccatgg agagtggcggtgaatgcattttggaagcgggccgtgcaagagaagaaagaattagagctt gcccctcctgttgctgcagttgttcttccagccacatgtactctgaagatgggatcacaa tcagtgtctccataccagaaaaggctctgtaactttttggtcatgggagagctctttcct ctgccctctcctttgcagttcctagaccaggttgctgcagcaactggtcacacagtagtg ttggtagaccagacagaggacatcctggcaaaatccaaaaagggaattgaggaaagcctt aggaaagtggcaaagaagaagtttgcagaaaaccttaag