GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:16:28 Sequence gi568815591r:73734746_73942140 : 207395 bp : 50.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 662 754 93 2 0 42 80 119 0.660 5.58 1.02 Intr + 983 1103 121 2 1 112 75 103 0.964 11.90 1.03 Term + 1185 1328 144 2 0 137 47 139 0.999 12.61 1.04 PlyA + 1371 1376 6 -1.95 2.07 PlyA - 1510 1505 6 -0.45 2.06 Term - 1946 1814 133 1 1 21 48 197 0.924 6.56 2.05 Intr - 2365 2184 182 1 2 69 66 127 0.906 7.27 2.04 Intr - 2646 2476 171 0 0 81 59 250 0.995 21.54 2.03 Intr - 3053 2817 237 2 0 56 84 318 0.398 26.01 2.02 Intr - 3697 3583 115 0 1 97 84 184 0.534 19.35 2.01 Init - 4025 3901 125 2 2 80 78 135 0.978 9.35 2.00 Prom - 23842 23803 40 -4.76 3.02 PlyA - 29994 29989 6 1.05 3.01 Sngl - 35304 34642 663 0 0 91 49 1454 0.967 135.98 3.00 Prom - 63910 63871 40 -1.06 4.00 Prom + 75298 75337 40 -5.66 4.01 Sngl + 96457 97086 630 0 0 80 45 1252 0.655 114.29 4.02 PlyA + 97917 97922 6 1.05 5.07 PlyA - 97952 97947 6 1.05 5.06 Term - 100257 99966 292 2 1 144 41 263 0.965 22.02 5.05 Intr - 104827 104740 88 2 1 62 89 -14 0.368 -4.87 5.04 Intr - 105375 105286 90 1 0 38 94 145 0.998 10.07 5.03 Intr - 105804 105669 136 0 1 80 75 59 0.992 3.94 5.02 Intr - 106453 106325 129 1 0 53 78 111 0.994 7.49 5.01 Init - 107395 107273 123 1 0 86 97 184 0.999 17.34 5.00 Prom - 116117 116078 40 -4.76 6.02 PlyA - 116992 116987 6 1.05 6.01 Sngl - 126371 126195 177 2 0 90 49 206 0.964 9.86 6.00 Prom - 127681 127642 40 -4.26 7.00 Prom + 127774 127813 40 -7.96 7.01 Init + 130479 130676 198 2 0 60 98 165 0.651 11.61 7.02 Intr + 130832 131020 189 1 0 113 -1 163 0.450 9.68 7.03 Intr + 141517 141630 114 0 0 78 85 65 0.842 5.84 7.04 Intr + 142179 142322 144 2 0 21 41 130 0.642 2.08 7.05 Intr + 145801 145861 61 0 1 112 24 62 0.439 0.51 7.06 Intr + 151908 152013 106 1 1 118 68 82 0.364 8.47 7.07 Intr + 156735 156926 192 0 0 94 9 138 0.342 5.11 7.08 Intr + 165188 165317 130 2 1 74 81 53 0.218 3.90 7.09 Intr + 166072 166220 149 0 2 126 58 -25 0.072 -2.67 7.10 Intr + 167204 167480 277 2 1 137 75 37 0.104 4.82 7.11 Intr + 170861 170996 136 1 1 109 -30 85 0.035 -1.06 7.12 Intr + 174947 174984 38 0 2 114 66 27 0.058 1.08 7.13 Term + 177481 177669 189 0 0 66 49 122 0.240 3.55 7.14 PlyA + 179096 179101 6 -0.45 8.03 PlyA - 179121 179116 6 1.05 8.02 Term - 182233 182098 136 0 1 74 48 145 0.973 6.59 8.01 Init - 183062 182935 128 2 2 28 82 113 0.921 4.33 8.00 Prom - 189459 189420 40 -1.96 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_1|119_aa XPEYQAAGALGDTRTEPSLDQVLQERDEAIAKKQAVEAELQRCKARLHAMEAQLLEVLEE KLRLRRELEAWEEDVQQLVWQQVQNQLQREAKGTRGAHVDPGAASTPRSRFSLGRGRWW >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_1|360_bp nncccagagtaccaggcggctggagcactgggggacacccggacagaaccctccctggac caagtcctccaggaacgggatgaagccattgccaagaagcaggcggtggaggcggagctg cagagatgcaaagccaggctacacgccatggaggcccagctgctggaggtcctggaggag aaactgaggctgaggcgggagctggaggcctgggaggaggacgtgcagcagctggtgtgg cagcaggtccagaatcagctgcagagagaggccaagggtactcggggagcccacgtggac cctggagctgccagcaccccccgatccagattctccctgggtcggggacgttggtggtga >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_2|320_aa MLRWTRAWRLPREGLGPHGPSFARVPVAPSSSSGGRGGAEPRLLDGEAALPAVVFLHGLF GSKTNFNSIAKILAQQTGRRPLAVYPQNGDINAAPSHPTPQVLTVDARNHGDSPHSPDMS YEIMSQDLQDLLPQLGLVPCVVVGHSMGGKTAMLLALQRPELVERLIAVDISPVESTGVS HFATYVAAMRAINIADELPRSRARKLADEQLSSVIQDMAVRQHLLTNLVEVDGRFVWRVN LDALTQHLDKILAFPQRQESYLGPTLFLLGGNSQFVHPSHHPEIMRLFPRAQMQTVPNAG HWIHADRPQDFIAAIRGFLV >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_2|963_bp atgctccgctggacccgagcctggaggctcccgcgtgagggactcggcccccacggccct agcttcgcgagggtgcctgtcgcacccagcagcagcagcggcggccgagggggcgccgag ccgaggcttctggacggggaggcagccctcccggccgtcgtctttttgcacgggctcttc ggcagcaaaactaacttcaactccatcgccaagatcttggcccagcagacaggccgtagg cctttggctgtctacccccagaatggggatatcaacgcagccccgtctcaccccactccc caggtgctgacggtggatgctcgtaaccacggtgacagcccccacagcccagacatgagc tacgagatcatgagccaggacctgcaggaccttctgccccagctgggcctggtgccctgc gtcgtcgttggccacagcatgggaggaaagacagccatgctgctggcactacagaggcca gagctggtggaacgtctcattgctgtagatatcagcccagtggaaagcacaggtgtctcc cactttgcaacctatgtggcagccatgagggccatcaacatcgcagatgagctgccccgc tcccgtgcccgaaaactggcggatgaacagctcagttctgtcatccaggacatggccgtg cggcagcacctgctcactaacctggtagaggtagacgggcgcttcgtgtggagggtgaac ttggatgccctgacccagcacctagacaagatcttggctttcccacagaggcaggagtcc tacctcgggccaacactctttctccttggtggaaactcccagttcgtgcatcccagccac caccctgagattatgcggctcttccctcgggcccagatgcagacggtgccgaacgctggc cactggatccacgctgaccgcccacaggacttcatagctgccatccgaggcttcctggtc taa >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_3|220_aa MSMGLEITGTALAVLGWLGTIVCCALPMWRVSAFIGSNIITSQNIWEGLWMNCVVQSTGQ MQCKVYDSLLALPQDLQAARALIVVAILLAAFGLLVALVGAQCTNCVQDDTAKAKITIVA GVLFLLAALLTLVPVSWSANTIIRDFYNPVVPEAQKREMGAGLYVGWAAAALQLLGGALL CCSCPPREKKYTATKVVYSAPRSTGPGASLGTGYDRKDYV >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_3|663_bp atgtccatgggcctggagatcacgggcaccgcgctggccgtgctgggctggctgggcacc atcgtgtgctgcgcgttgcccatgtggcgcgtgtcggccttcatcggcagcaacatcatc acgtcgcagaacatctgggagggcctgtggatgaactgcgtggtgcagagcaccggccag atgcagtgcaaggtgtacgactcgctgctggcactgccacaggaccttcaggcggcccgc gccctcatcgtggtggccatcctgctggccgccttcgggctgctagtggcgctggtgggc gcccagtgcaccaactgcgtgcaggacgacacggccaaggccaagatcaccatcgtggca ggcgtgctgttccttctcgccgccctgctcaccctcgtgccggtgtcctggtcggccaac accattatccgggacttctacaaccccgtggtgcccgaggcgcagaagcgcgagatgggc gcgggcctgtacgtgggctgggcggccgcggcgctgcagctgctggggggcgcgctgctc tgctgctcgtgtcccccacgcgagaagaagtacacggccaccaaggtcgtctactccgcg ccgcgctccaccggcccgggagccagcctgggcacaggctacgaccgcaaggactacgtc taa >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_4|209_aa MASMGLQVMGIALAVLGWLAVMLCCALPMWRVTAFIGSNIVTSQTIWEGLWMNCVVQSTG QMQCKVYDSLLALPQDLQAARALVIISIIVAALGVLLSVVGGKCTNCLEDESAKAKTMIV AGVVFLLAGLMVIVPVSWTAHNIIQDFYNPLVASGQKREMGASLYVGWAASGLLLLGGGL LCCNCPPRTDKPYSAKYSAARSAAASNYV >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_4|630_bp atggcctccatggggctacaggtaatgggcatcgcgctggccgtcctgggctggctggcc gtcatgctgtgctgcgcgctgcccatgtggcgcgtgacggccttcatcggcagcaacatt gtcacctcgcagaccatctgggagggcctatggatgaactgcgtggtgcagagcaccggc cagatgcagtgcaaggtgtacgactcgctgctggcactgccgcaggacctgcaggcggcc cgcgccctcgtcatcatcagcatcatcgtggctgctctgggcgtgctgctgtccgtggtg gggggcaagtgtaccaactgcctggaggatgaaagcgccaaggccaagaccatgatcgtg gcgggcgtggtgttcctgttggccggccttatggtgatagtgccggtgtcctggacggcc cacaacatcatccaagacttctacaatccgctggtggcctccgggcagaagcgggagatg ggtgcctcgctctacgtcggctgggccgcctccggcctgctgctccttggcggggggctg ctttgctgcaactgtccaccccgcacagacaagccttactccgccaagtattctgctgcc cgctctgctgctgccagcaactacgtgtaa >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_5|285_aa MAQEEGGSLPEVRARVRAAHGIPDLAQKLHFYDRWAPDYDQDVATLLYRAPRLAVDCLTQ ALPGPPHSALILDVACGTGLVAAELRAPGFLQLHGVDGSPGMLEQAQAPGLYQRLSLCTL GQEPLPSPEGTFDAVLIVGALSDGQVPCNAIPELHVTKPVFSSGKWGCSGEMRAVSTVCK LCTLRCPGRWAGVSDHQDQLVQPSIQGGSGGHPGQAGAGWDVGRPGGLACGPPVDRWELA TSELEVVSGISAKDGFISGIVYLYRKWKATQVEEVRSSPQPPAGP >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_5|858_bp atggcccaggaggagggtgggagcctgcccgaggtgcgggcgcgggtcagggccgcgcat ggcatccccgacctggcccaaaagctccatttctatgaccgctgggctccggactacgac caggatgtggccaccctgctgtaccgtgcgccccgcctcgcagtggactgcctcacacaa gcccttccaggcccgccccacagtgccctgatcctggacgtggcctgtggcacaggccta gtggctgccgagctgcgggctccaggcttcctccagctgcatggggtggatgggagccca gggatgctggaacaggcccaggcccccggcctctatcagcgcctcagcctctgcaccctg ggccaggagcctctgcccagcccggaagggaccttcgacgcggtgctgatagtcggtgcc ctcagtgacggccaggtgccctgcaatgcgatacctgagctacatgtcaccaagccagtc ttctcatctgggaaatggggctgctcaggggagatgagagctgtgagcaccgtttgtaaa ctttgcacattgcgctgcccgggcaggtgggctggtgtgtctgaccaccaggaccaactc gtccaaccttcaatacaaggaggctctggaggccaccctggacaggctggagcaggctgg gatgtgggaaggcctggtggcctggcctgtggaccgcctgtggaccgctgggagctggct acctccgagctggaggtggtatccggcatctctgccaaggatggcttcatctccggcatt gtctacctgtaccgaaagtggaaggcgacccaggttgaggaagtgagatccagcccccag cccccagctggcccctga >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_6|58_aa MGTRLPPVTHCLYLAPAAPLLPLKPQSTSEPNTAKEVLEQSFLRVGFPQILEQESSAA >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_6|177_bp atggggacccggttgccgccagtgacccactgcctttacctggctcctgccgctccactg cttcctctgaaacctcagtccacgtctgagcccaacacggccaaggaggtcttagagcaa agcttccttcgggtcggatttccccagatcttggaacaggaatcaagtgccgcctag >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_7|640_aa MWGSTKGLGLALLSAWEQLGLSVAIWTDLFLSCLHGLMLVALLLVVVTWRVCQKSHCFRL GRQLSKALQVNCVVRKLLVQLRRLYWWVETMTALTSWHLAYLITWTTCLASHLLQAAFEH TTQLAEAQELSDLLEDLQRQNEQKAPWGRWHHPWTILMDLPISSHPQGEFQLGTKQSRLP SDSIGVSLWSCLIEPLIRTSAKRAEKKLPAFPKLRPAAGNGKLTCPKGRCTGMPGAPIFA GSPHSLPLRRPILSAGPQTAHSFMLISSRWLCLTWGSEPFFCPPGDSQSGMDGREGLHFC LLSGAYSASAETSWLLLVSDSTIICLDRMDDAFPRAVKVLKISKFLPTLPFQDISPAQAL QQLYPPAAPSHVGTRSQEGECQSRDPHPGSPSPAYTASFEESESCFVLQAGVCWYDHSSL QPQPPGLSASLALSPSAPGPGAVAFSQDLTWSHALSGETSSCSPSHGMPGMQPYICRPPS RRKRSSGWWTFKDSSESKLRPGRAVLRIWHHHRGAWLGAQAHQLPHGDSRVSCFPIPKTP TASCSSPFCIAHHQGDLHHVWACSCRGPVALEKIPSWEPLYKSSPKMNLEERDYSSEQFA NPEDTAFSVKEDVHSREPREGSGFTAKVPAQVPNRVRLCK >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_7|1923_bp atgtggggcagcaccaagggcctgggcctggccttgctcagtgcctgggagcagctgggc ctgtctgtggccatctggacagatctgtttttgtcatgtctgcacggcctgatgttggtg gccttgctcttggtggtagtgacctggagggtgtgtcagaagtcccactgcttccgactg ggcaggcagctcagtaaggccttgcaagtgaactgcgtggtaaggaagctcctggtacag ctgagacgtctgtattggtgggtggagactatgactgccctcacctcctggcacctggcc tatctcatcacctggaccacctgcctggcctcccacctgctgcaggctgcctttgagcac acgacccagctggccgaggcccaggagctgtctgacctgctagaggacctacagagacag aatgagcagaaagcaccctggggccgttggcatcatccttggaccatcctgatggacctt cccatctcatcccatccgcagggggagtttcagctggggaccaagcagagccgcctgccc tccgattccattggtgtgtctctgtggagctgcctcatcgagcccctaattcgcacctcg gccaagagggccgagaagaaacttcccgcattccccaaactccggcccgctgctggaaat gggaagctgacatgtcccaaaggtcgatgcactggaatgcccggagctccgatcttcgcc ggctctcctcactcgctccctctccggcggcccatcctgtccgctggcccccagacagca cattccttcatgctgatctcttcccggtggctatgcctaacctggggatcagagcccttc ttttgccccccaggggactcccagtctgggatggacgggagagaaggactccacttctgc ctcctgtcaggagcctactcagcttctgctgaaacctcctggctgctgctggtgtcggat tccaccatcatctgcctggaccgcatggatgacgccttccctagggctgtcaaggttcta aaaatttccaaattcctccccaccctgcccttccaggatatttctccagctcaggccctt cagcagctctacccaccagctgctccttctcacgtgggcacacggagtcaggaaggagag tgtcaaagccgggacccccatcctggcagtccctcccccgcatacactgcctctttcgaa gagtcagagtcttgctttgtcctccaggctggagtgtgctggtacgatcatagctccctg cagcctcaacctcctgggctcagtgccagcttggccctcagtccctctgctccaggtcct ggggctgtggccttttcccaggatctgacctggagtcatgcactttctggggagacctcc agttgctcccccagccatggcatgccgggaatgcagccctacatatgcaggcccccttca aggcggaagaggagctcgggctggtggactttcaaggactcatcagagagcaagctccgg cccggtcgggctgtcctccggatctggcaccatcataggggggcttggctgggcgcccag gcacatcagctgccccatggagactcccgcgtatcctgcttccccatccccaaaaccccc acggccagctgttcttcccctttctgcatcgcccatcaccaaggcgacctgcaccatgtc tgggcctgctcctgccgagggcctgtggctctagaaaagatccccagctgggagcctttg tacaaaagttctccgaagatgaatttggaagaaagagactattccagtgaacagtttgca aacccagaagacacagccttcagtgtcaaagaagatgtgcattccagagaaccaagggag ggctcaggttttacagcaaaagttcccgcccaggttcccaatcgtgttcgtttatgcaaa tga >gi568815591r:73734746_73942140|GENSCAN_predicted_peptide_8|87_aa MIEGDPVSKNNNDDDEDENICGGVRDRYEWFTCMNVFDDPHGRPPVCDPSGGGPSPGSTY GMEVLIEGPHHKSPELPNTQKLAKWRG >gi568815591r:73734746_73942140|GENSCAN_predicted_CDS_8|264_bp atgatagagggagaccctgtctctaaaaataataatgatgatgatgaagatgaaaacatt tgtggaggtgtcagggaccgttatgaatggttcacttgtatgaatgtgtttgatgaccca catggtaggccccccgtttgtgacccatctggaggtggccccagccccggcagtacttac gggatggaagtgttgatcgagggcccccaccacaagtcacctgagctgccaaacacgcag aagctggccaagtggcgaggctga