GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:40:49 Sequence gi568815597r:205878854_206094135 : 215282 bp : 45.98% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8801 8840 40 1 1 68 119 54 0.114 6.86 1.02 Term + 21511 21911 401 2 2 -9 54 259 0.166 8.08 1.03 PlyA + 21999 22004 6 1.05 2.00 Prom + 23086 23125 40 -4.96 2.01 Init + 23877 24609 733 2 1 26 40 296 0.391 13.54 2.02 Intr + 29031 29224 194 1 2 53 77 113 0.568 5.91 2.03 Term + 32836 32847 12 0 0 130 44 6 0.123 -1.40 2.04 PlyA + 33304 33309 6 1.05 3.22 PlyA - 34222 34217 6 1.05 3.21 Term - 36330 36039 292 2 1 118 50 199 0.910 13.92 3.20 Intr - 36551 36508 44 2 2 112 48 23 0.970 -2.26 3.19 Intr - 38501 38430 72 2 0 109 99 126 0.944 15.40 3.18 Intr - 40132 39987 146 2 2 98 74 124 0.999 12.00 3.17 Intr - 41377 41323 55 1 1 49 95 121 0.999 7.45 3.16 Intr - 42994 42713 282 1 0 109 100 439 0.948 44.82 3.15 Intr - 44342 44229 114 2 0 77 63 43 0.717 1.34 3.14 Intr - 44574 44482 93 0 0 61 116 116 0.978 11.86 3.13 Intr - 44760 44691 70 2 1 69 73 80 0.524 3.78 3.12 Intr - 45636 45530 107 0 2 99 81 72 0.973 6.61 3.11 Intr - 47777 47682 96 2 0 34 75 128 0.923 6.31 3.10 Intr - 48752 48639 114 2 0 109 99 43 0.889 8.14 3.09 Intr - 49196 49049 148 1 1 98 70 239 0.911 23.34 3.08 Intr - 50056 49974 83 1 2 73 100 86 0.999 6.74 3.07 Intr - 50503 50351 153 1 0 89 79 323 0.994 31.77 3.06 Intr - 51203 51039 165 2 0 112 59 402 0.999 39.86 3.05 Intr - 53182 53007 176 2 2 44 94 312 0.994 27.06 3.04 Intr - 53959 53849 111 2 0 111 110 64 0.997 11.25 3.03 Intr - 54231 54092 140 2 2 81 67 162 0.999 13.51 3.02 Intr - 56969 56843 127 0 1 56 93 227 0.897 19.74 3.01 Init - 57261 57201 61 0 1 78 96 61 0.886 7.31 3.00 Prom - 58736 58697 40 -0.76 4.08 PlyA - 59655 59650 6 -0.45 4.07 Term - 60005 59810 196 1 1 81 43 101 0.082 1.78 4.06 Intr - 90575 90497 79 0 1 101 70 98 0.800 7.91 4.05 Intr - 93348 93324 25 0 1 63 98 4 0.130 -3.50 4.04 Intr - 106812 106687 126 0 0 111 96 113 0.875 15.28 4.03 Intr - 113842 113627 216 1 0 73 79 347 0.768 31.00 4.02 Intr - 114693 114567 127 2 1 54 99 67 0.958 5.08 4.01 Init - 115282 115230 53 1 2 40 94 110 0.955 7.44 4.00 Prom - 124725 124686 40 -2.66 5.10 PlyA - 129996 129991 6 1.05 5.09 Term - 131494 131330 165 1 0 97 49 151 0.983 10.02 5.08 Intr - 133553 133455 99 2 0 44 99 136 0.999 10.61 5.07 Intr - 133796 133655 142 1 1 49 81 182 0.789 13.96 5.06 Intr - 135041 134919 123 1 0 65 78 124 0.988 8.80 5.05 Intr - 137277 137078 200 0 2 103 110 199 0.999 21.75 5.04 Intr - 142314 142196 119 1 2 27 65 62 0.663 -2.02 5.03 Intr - 143414 143297 118 2 1 114 92 119 0.991 14.94 5.02 Intr - 144192 144048 145 2 1 45 48 186 0.527 10.58 5.01 Init - 144938 144871 68 2 2 63 94 113 0.999 8.01 5.00 Prom - 148158 148119 40 -5.26 6.00 Prom + 154909 154948 40 -5.06 6.01 Init + 155192 155268 77 1 2 60 96 58 0.745 4.39 6.02 Intr + 172326 172493 168 0 0 87 78 47 0.059 2.66 6.03 Intr + 181106 181171 66 2 0 116 91 1 0.093 1.22 6.04 Intr + 181874 181980 107 2 2 69 70 83 0.844 4.46 6.05 Intr + 182492 182544 53 0 2 101 99 12 0.948 2.23 6.06 Term + 182809 182967 159 0 0 61 45 145 0.702 5.44 6.07 PlyA + 183367 183372 6 1.05 7.04 PlyA - 183436 183431 6 1.05 7.03 Term - 188277 188153 125 1 2 86 48 82 0.116 2.55 7.02 Intr - 191575 191385 191 0 2 54 76 29 0.029 -2.47 7.01 Init - 197736 197654 83 0 2 64 47 90 0.020 2.94 7.00 Prom - 206959 206920 40 -0.06 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:205878854_206094135|GENSCAN_predicted_peptide_1|146_aa MGAVHEALWQYSPERNSININKKDIHTKTPSVGQQHQKPKVDKTTKMRGNRNREAENSKN QNTSSPPKDYNSLPAREQNWMENEFDELIEVGFRKWVITNFSELKEHVPTHHKEAKNLEK RLEEWLTRITSVEKSINDLMELKNTA >gi568815597r:205878854_206094135|GENSCAN_predicted_CDS_1|441_bp atgggggctgtccatgaagccttgtggcagtacagccctgaaaggaatagcatcaacatc aacaaaaaggacatccacaccaaaaccccatccgtaggtcagcaacatcaaaaaccaaag gtagataaaaccacaaagatgaggggaaaccggaacagagaggctgaaaattccaaaaac cagaacacctcttctcctccaaaggattacaactccttgccagcaagggaacaaaactgg atggagaatgagtttgatgagttgatagaagtaggcttcagaaagtgggtaataacaaac ttctctgagctaaaggagcatgttccaacccatcacaaggaagctaaaaaccttgaaaaa aggttagaggaatggctaactagaataaccagtgtagagaagagcataaatgacctgatg gaactgaaaaacacagcatga >gi568815597r:205878854_206094135|GENSCAN_predicted_peptide_2|312_aa MLARLIKKKREKTQIDAIKNDKGDITTNPTEIQTTIREYYKHLYANKLENLEEMDKFLDT YTLPRLNQEEVESLNRPITGSEIEAIISLPTKKSPGPDGFTAEFYQRYKEELVPFLLKLF QSIEKEGILPNSFYEASIILIPKPGRDTTKKENFGSISLMNINAKILNKILANRIQQHFK KLIHHDQVSFIPGMQDWFNIPKSINVIHHINRTNDKNHMIISIDAGKAFDKIQQPCMLKT LNKLVIFTSFTGELVSGTPHIVMPLLQCALLWGPVATTPFQNSDVGMTTSRETKDVGNSK IKVLADLMLVRH >gi568815597r:205878854_206094135|GENSCAN_predicted_CDS_2|939_bp atgctagcaagactaataaagaagaaaagagagaagactcaaatagatgcaataaaaaat gataaaggggatatcaccaccaatcccacagaaatacaaactaccatcagagaatactat aaacatctctatgcaaataaattagaaaatctagaagaaatggataaattcctggacaca tacaccctcccaagactaaaccaggaagaagtcgaatctctgaatagaccaataacaggt tctgaaattgaggcaataattagcctaccaaccaaaaaaagtccaggaccagatggattc acagctgaattctaccagaggtacaaagaggagctggtaccattccttctgaaattattc caatcaatagaaaaagagggaatcctccctaactcgttttatgaggccagcatcatcctg ataccaaagcctggcagagacacaacaaaaaaagagaattttgggtcaatatccctgatg aacatcaatgcaaaaatcctcaataaaatactggcaaaccgaatccagcagcacttcaaa aagcttatccaccatgatcaagtcagcttcatccctgggatgcaagactggttcaacata cccaaatcaataaacgtaatccatcacataaacagaaccaatgacaaaaaccacatgatt atctcaatagatgcaggaaaggcctttgacaaaattcaacagccctgcatgctaaaaact ctcaataaactagtaattttcaccagtttcacaggggagctggtcagtggaactcctcac attgtcatgccacttctgcagtgtgccctcctctgggggccagtagccactacacctttc cagaactcagatgttgggatgacaacatcaagggaaaccaaggatgttgggaattccaag atcaaggtgctggcagatttgatgttggtgcgtcattga >gi568815597r:205878854_206094135|GENSCAN_predicted_peptide_3|882_aa MVTPQVWSASHVLSAKQNSQDMSQPRPRYVVDRAAYSLTLFDDEFEKKDRTYPVGEKLRN AFRCSSAKIKAVVFGLLPVLSWLPKYKIKDYIIPDLLGGLSGGSIQVPQGMAFALLANLP AVNGLYSSFFPLLTYFFLGGVHQMVPGTFAVISILVGNICLQLAPESKFQVFNNATNESY VDTAAMEAERLHVSATLACLTAIIQMGLGFMQFGFVAIYLSESFIRGFMTAAGLQILISV LKYIFGLTIPSYTGPGSIVFTFIDICKNLPHTNIASLIFALISGAFLVLVKELNARYMHK IRFPIPTEMIVVVVATAISGGCKMPKKYHMQIVGEIQRGFPTPVSPVVSQWKDMIGTAFS LAIVSYVINLAMGRTLANKHGYDVDSNQEMIALGCSNFFGSFFKIHVICCALSVTLAVDG AGGKSQSVLGALIAVNLKNSLKQLTDPYYLWRKSKLDCCIWVVSFLSSFFLSLPYGVAVG VAFSVLVVVFQTQFRNGYALAQVMDTDIYVNPKTYNRAQDIQGIKIITYCSPLYFANSEI FRQKVIAKTGMDPQKVLLAKQKYLKKQEKRRMRPTQQRRSLFMKTKTVSLQELQQDFENA PPTDPNNNQTPANGTSVSYITFSPDSSSPAQSEPPASAEAPGEPSDMLASVPPFVTFHTL ILDMSGVSFVDLMGIKALAKLSSTYGKIGVKVFLVNIHAQVYNDISHGGVFEDGSLECKH VFPSIHDAVLFAQANARDVTPGHNFQGAPGDAELSLYDSEEDIRSYWDLEQEMFGSMFHA ETLTALESLSAAGGCYPYRSESLVSPLFTRQALAAMDKPPAHSTPPTSALSLAAEGHLDF QLLRVSQKQKDKYNCAGLLYKLQKVSQSPHGSVSDGVRLSRT >gi568815597r:205878854_206094135|GENSCAN_predicted_CDS_3|2649_bp atggtgacgccacaggtgtggagtgccagccacgtgctgagcgccaagcaaaacagccag gatatgagccagcccaggccccgctacgtggtagacagagccgcatactcccttaccctc ttcgacgatgagtttgagaagaaggaccggacatacccagtgggagagaaacttcgcaat gccttcagatgttcctcagccaagatcaaagctgtggtgtttgggctgctgcctgtgctc tcctggctccccaagtacaagattaaagactacatcattcctgacctgctcggtggactc agcgggggatccatccaggtcccacaaggcatggcatttgctctgctggccaaccttcct gcagtcaatggcctctactcctccttcttccccctcctgacctacttcttcctggggggt gttcaccagatggtgccaggtacctttgccgttatcagcatcctggtgggtaacatctgt ctgcagctggccccagagtcgaaattccaggtcttcaacaatgccaccaatgagagctat gtggacacagcagccatggaggctgagaggctgcacgtgtcagctacgctagcctgcctc accgccatcatccagatgggtctgggcttcatgcagtttggctttgtggccatctacctc tccgagtccttcatccggggcttcatgacggccgccggcctgcagatcctgatttcggtg ctcaagtacatcttcggactgaccatcccctcctacacaggcccagggtccatcgtcttt accttcattgacatttgcaaaaacctcccccacaccaacatcgcctcgctcatcttcgct ctcatcagcggtgccttcctggtgctggtgaaggagctcaatgctcgctacatgcacaag attcgcttccccatccctacagagatgattgtggtggtggtggcaacagctatctccggg ggctgtaagatgcccaaaaagtatcacatgcagatcgtgggagaaatccaacgcgggttc cccaccccggtgtcgcctgtggtctcacagtggaaggacatgataggcacagccttctcc ctagccatcgtgagctacgtcatcaacctggctatgggccggaccctggccaacaagcac ggctacgacgtggattcgaaccaggagatgatcgctctcggctgcagcaacttctttggc tccttctttaaaattcatgtcatttgctgtgcgctttctgtcactctggctgtggatgga gctggaggaaaatcccagtctgtgctaggagccctgatcgctgtcaatctcaagaactcc ctcaagcaactcaccgacccctactacctgtggaggaagagcaagctggactgttgcatc tgggtagtgagcttcctctcctccttcttcctcagcctgccctatggtgtggcagtgggt gtcgccttctccgtcctggtcgtggtcttccagactcagtttcgaaatggctatgcactg gcccaggtcatggacactgacatttatgtgaatcccaagacctataatagggcccaggat atccaggggattaaaatcatcacgtactgctcccctctctactttgccaactcagagatc ttcaggcaaaaggtcatcgccaagacaggcatggacccccagaaagtattactagccaag caaaaatacctcaagaagcaggagaagcggagaatgaggcccacacaacagaggaggtct ctattcatgaaaaccaagactgtctccctgcaggagctgcagcaggactttgagaatgcg ccccccaccgaccccaacaacaaccagaccccggctaacggcaccagcgtgtcctatatc accttcagccctgacagctcctcacctgcccagagtgagccaccagcctccgctgaggcc cccggcgagcccagtgacatgctggccagcgtcccacccttcgtcaccttccacaccctc atcctggacatgagtggagtcagcttcgtggacttgatgggcatcaaggccctggccaag ctgagctccacctatgggaagatcggcgtgaaggtcttcttggtgaacatccatgcccag gtgtacaatgacattagccatggaggcgtctttgaggatgggagtctagaatgcaagcac gtctttcccagcatacatgacgcagtcctctttgcccaggcaaatgctagagacgtgacc ccaggacacaacttccaaggggctccaggggatgctgagctctccttgtacgactcagag gaggacattcgcagctactgggacttagagcaggagatgttcgggagcatgtttcacgca gagaccctgaccgccctagagagcctctcagcagcaggggggtgctacccttacaggagt gagagtctggtgagcccactcttcacccgtcaggccctggccgcaatggacaagcctcct gctcactccaccccacccacctctgccctgtccttggcagctgaaggacaccttgacttc cagcttttacgagtgagccaaaaacagaaggacaagtacaactgtgctggcctgctgtac aagcttcaaaaagtgtcccagagcccacacggctcggtgtcagatggtgtcaggctgtca cggacatag >gi568815597r:205878854_206094135|GENSCAN_predicted_peptide_4|273_aa MNPRKKVDLKLIIVGAIGVGKTSLLHQYVHKTFYEEYQTTLGASILSKIIILGDTTLKLQ IWDTGGQERFRSMVSTFYKGSDGCILAFDVTDLESFEALDIWRGDVLAKIVPMEQSYPMV LLGNKIDLADRKVPQEVAQGWCREKDIPYFEVSAKNDINVVQAFEMLASRALSRHPAKRL VHTPGTFWAYPLLLMKAAARWKERPPVGRKMAKANTSVQAGGVHMEILGEREILQGFKVV RKGFIGKKEKGKEWERKERVGGGTSTGLKDGQI >gi568815597r:205878854_206094135|GENSCAN_predicted_CDS_4|822_bp atgaatccccggaagaaggtggacctgaaactcattatcgtcggagccattggtgtggga aagacctccctccttcaccaatatgtgcacaagacgttttatgaggaataccagaccaca ctgggggccagcatcctctccaagattatcatattgggtgacacaactttgaagttacag atctgggacacgggcggtcaggagcggttccgctccatggtgtccacgttctacaagggc tccgatggctgcatcctagcttttgatgtcaccgacctggagtcttttgaagccctggat atctggcggggtgatgtcctggccaagattgtccccatggagcagtcctaccccatggtg ttgttggggaacaagatcgatctggcagaccggaaggtaccccaggaagtagctcaaggc tggtgtagagagaaagatattccttactttgaagtcagtgccaagaatgacatcaatgtg gtgcaagcgtttgagatgctggccagtagggctctgtcgaggcacccagcaaagagattg gtgcacacacctggcacgttctgggcctacccgctgctcctcatgaaggcagctgctcgt tggaaggaaaggccaccagttggcagaaaaatggccaaagccaacactagtgtacaggct ggaggggtgcacatggagatcctgggagaaagagaaatccttcagggtttcaaggtggtc aggaaaggcttcataggcaagaaggagaagggaaaagagtgggagaggaaggagagggtt ggaggaggaacttcaacaggtcttaaggatgggcagatctag >gi568815597r:205878854_206094135|GENSCAN_predicted_peptide_5|392_aa MKTLLLLLLVLLELGEAQGSLHRRHPSLKKKLRARSQLSEFWKSHNLDMIQFTESCSMDQ SAKEPLINYLDMEYFGTISIGSPPQNFTVIFDTGSSNLWVPSVYCTSPACKTHSRFQPSQ SSTYSQPGQSFSIQYGTGSLSGIIGADQVSVEGLTVVGQQFGESVTEPGQTFVDAEFDGI LGLGYPSLAVGGVTPVFDNMMAQNLVDLPMFSVYMSSNPEGGAGSELIFGGYDHSHFSGS LNWVPVTKQAYWQIALDNIQVGGTVMFCSEGCQAIVDTGTSLITGPSDKIKQLQNAIGAA PVDGEYAVECANLNVMPDVTFTINGVPYTLSPTAYTLLDFVDGMQFCSSGFQGLDIHPPA GPLWILGDVFIRQFYSVFDRGNNRVGLAPAVP >gi568815597r:205878854_206094135|GENSCAN_predicted_CDS_5|1179_bp atgaaaacgctccttcttttgctgctggtgctcctggagctgggagaggcccaaggatcc cttcacaggaggcatccgtccctcaagaagaagctgcgggcacggagccagctctctgag ttctggaaatcccataatttggacatgatccagttcaccgagtcctgctcaatggaccag agtgccaaggaacccctcatcaactacttggatatggaatacttcggcactatctccatt ggctccccaccacagaacttcactgtcatcttcgacactggctcctccaacctctgggtc ccctctgtgtactgcactagcccagcctgcaagacgcacagcaggttccagccttcccag tccagcacatacagccagccaggtcaatctttctccattcagtatggaaccgggagcttg tccgggatcattggagccgaccaagtctctgtggaaggactaaccgtggttggccagcag tttggagaaagtgtcacagagccaggccagacctttgtggatgcagagtttgatggaatt ctgggcctgggatacccctccttggctgtgggaggagtgactccagtatttgacaacatg atggctcagaacctggtggacttgccgatgttttctgtctacatgagcagtaacccagaa ggtggtgcggggagcgagctgatttttggaggctacgaccactcccatttctctgggagc ctgaattgggtcccagtcaccaagcaagcttactggcagattgcactggataacatccag gtgggaggcactgttatgttctgctccgagggctgccaggccattgtggacacagggact tccctcatcactggcccttccgacaagattaagcagctgcaaaacgccattggggcagcc cccgtggatggagaatatgctgtggagtgtgccaaccttaacgtcatgccggatgtcacc ttcaccattaacggagtcccctataccctcagcccaactgcctacaccctactggacttc gtggatggaatgcagttctgcagcagtggctttcaaggacttgacatccaccctccagct gggcccctctggatcctgggggatgtcttcattcgacagttttactcagtctttgaccgt gggaataaccgtgtgggactggccccagcagtcccctaa >gi568815597r:205878854_206094135|GENSCAN_predicted_peptide_6|209_aa MKLAKSTQREFVEEGEAPGLCSEDCRHVSSFSEIDITQFNITYFTGVLWALSNQFPLRRL KICHQLRGSKVCGVGSPKFWQRYLALHILVISFSGPPVISANPCLSTAATALSGSIAVVS LILLLVGLLSMTLKKWRQERLFKKQLRHQTNFPHKSSDLSCHADAIYSNVINLAPQKEDD FAVYTNMPPFHHPRRTLPDQVEYVSIVFH >gi568815597r:205878854_206094135|GENSCAN_predicted_CDS_6|630_bp atgaaactggccaagagcacccagagagagtttgtagaagaaggagaagccccaggactg tgctctgaagactgcaggcacgtgtcctctttttctgaaattgacattacccagttcaac atcacctactttaccggagtactttgggctctttcaaatcaatttccgcttcgaaggcta aagatttgtcatcagctaagaggttcaaaagtctgtggtgtaggctctccaaaattctgg cagaggtacctagcactgcacatcctggttatctctttttctggacctcctgtcatatct gctaacccttgcctaagcacagcagccacagccctttctggctccattgctgtggtgtcc ctcatcttgctcctggtgggtctcttgtccatgaccctgaagaaatggaggcaagagaga ctatttaagaaacaactgaggcatcagaccaactttccccacaagtcctcggatctttcc tgccatgctgatgccatatattccaacgtgatcaacctggctccccagaaggaggacgac tttgctgtctacaccaacatgcccccttttcatcaccccaggaggacattgccagaccaa gtggaatatgtctccattgtattccactga >gi568815597r:205878854_206094135|GENSCAN_predicted_peptide_7|132_aa MSRTVVKEQENLAVVIQSDFPAVGVSLWRDNNSHIAKRLAVLLPTEQNFLRIFWKSSAQS LILQNALSSSSERSRGYRGMAGEKGSSPCQADFLRDKQKPISEKQRLEDPSMDFSRSPDT VARLHSLITGSP >gi568815597r:205878854_206094135|GENSCAN_predicted_CDS_7|399_bp atgagcagaacagttgtcaaggaacaagaaaatcttgcagttgtcatccagtccgacttc cctgccgtgggcgtgagcctctggagagataacaacagtcacatagccaaacgtctagca gtgcttctgcctaccgaacagaatttcctgagaatcttctggaaaagcagtgcccagtca ctcattctgcaaaatgcactcagcagctcctctgagagaagccgtggatatagggggatg gcaggagagaaaggcagctctccctgccaggcggactttctaagagataaacagaaacca atatctgaaaaacaaagactggaagatccttccatggacttcagcagatcacctgacacc gtggccagactccactcccttatcacaggttcaccatga