GENSCAN 1.0 Date run: 2-Nov-116 Time: 17:59:25 Sequence gi568815593f:65463311_65687393 : 224083 bp : 37.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.08 PlyA - 2483 2478 6 1.05 1.07 Term - 25309 24422 888 1 0 23 39 438 0.143 23.87 1.06 Intr - 26270 26182 89 0 2 59 108 47 0.373 2.57 1.05 Intr - 26825 26670 156 0 0 32 70 108 0.210 2.46 1.04 Intr - 27812 27658 155 1 2 81 55 100 0.310 4.79 1.03 Intr - 28553 28462 92 2 2 76 80 50 0.396 0.87 1.02 Intr - 52834 52767 68 2 2 112 72 65 0.032 5.01 1.01 Init - 76490 76388 103 2 1 84 74 51 0.137 3.76 1.00 Prom - 80448 80409 40 -5.85 2.03 PlyA - 82721 82716 6 -0.45 2.02 Term - 86622 86101 522 0 0 67 39 238 0.950 10.09 2.01 Init - 91597 91487 111 1 0 26 100 161 0.996 11.36 2.00 Prom - 97312 97273 40 -7.35 3.00 Prom + 97807 97846 40 -8.05 3.01 Init + 100001 100196 196 1 1 100 65 306 0.992 28.64 3.02 Intr + 104203 104294 92 2 2 86 56 57 0.607 1.09 3.03 Intr + 106540 106688 149 0 2 63 92 102 0.635 6.21 3.04 Intr + 108529 108976 448 1 1 56 94 316 0.977 21.42 3.05 Intr + 113569 113759 191 0 2 54 119 190 0.975 16.16 3.06 Intr + 116114 116303 190 2 1 62 80 140 0.644 9.27 3.07 Intr + 119728 119909 182 0 2 27 27 265 0.261 12.04 3.08 Intr + 121704 121785 82 0 1 68 115 -17 0.205 -2.28 3.09 Intr + 122689 122871 183 0 0 39 127 109 0.200 8.96 3.10 Term + 123943 124086 144 0 0 55 38 146 0.978 3.23 3.11 PlyA + 124114 124119 6 1.05 4.11 PlyA - 124217 124212 6 1.05 4.10 Term - 128638 128459 180 1 0 70 34 201 0.999 9.53 4.09 Intr - 131335 131211 125 2 2 62 95 54 0.677 2.88 4.08 Intr - 133870 133741 130 1 1 17 111 131 0.621 7.65 4.07 Intr - 141735 141601 135 0 0 61 90 152 0.961 12.54 4.06 Intr - 146148 145933 216 0 0 88 96 198 0.967 18.38 4.05 Intr - 147733 147551 183 1 0 30 22 215 0.996 8.16 4.04 Intr - 148571 148293 279 2 0 60 86 126 0.913 6.35 4.03 Intr - 150909 150788 122 1 2 72 57 121 0.993 6.69 4.02 Intr - 154945 154783 163 1 1 35 63 154 0.966 6.23 4.01 Init - 158724 158722 3 0 0 108 81 0 0.552 1.35 4.00 Prom - 159193 159154 40 -8.15 5.02 PlyA - 159342 159337 6 -0.45 5.01 Sngl - 160964 160632 333 2 0 88 43 240 0.904 15.49 5.00 Prom - 163117 163078 40 -7.15 6.00 Prom + 164403 164442 40 -6.65 6.01 Init + 166278 166629 352 2 1 61 32 168 0.543 5.87 6.02 Intr + 171991 172059 69 2 0 89 60 52 0.508 0.74 6.03 Intr + 172634 172733 100 0 1 63 86 101 0.986 5.65 6.04 Intr + 174386 174470 85 2 1 98 106 33 0.997 5.00 6.05 Intr + 183745 183872 128 0 2 69 75 131 0.856 8.46 6.06 Intr + 197389 197587 199 1 1 74 101 95 0.813 7.83 6.07 Term + 198740 198844 105 1 0 79 38 85 0.539 0.03 6.08 PlyA + 200202 200207 6 1.05 7.06 PlyA - 200820 200815 6 1.05 7.05 Term - 207047 206936 112 1 1 87 34 166 0.960 8.15 7.04 Intr - 208688 208605 84 1 0 79 94 59 0.481 3.62 7.03 Intr - 208971 208934 38 0 2 71 87 27 0.477 -2.96 7.02 Intr - 217484 217346 139 1 1 107 95 122 0.943 14.35 7.01 Intr - 222162 222058 105 2 0 113 98 92 0.998 11.11 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:65463311_65687393|GENSCAN_predicted_peptide_1|516_aa MAGSTWVLKEGASWQSLELMTSVQESHGAQAQRYYLYVEALTPSAIVFGDRILEDIKNQC LAIAHSRFKFLASSRQFPLQLPKARDFRRPLAFYQSNCTGERGMIRLFRDLLDTGSELTF IPGDPKHHCGPPVKVGAYGVKFLVVQWCGASQDIPSKVKDKLLHLPPPTTKKEGQRLVGL FGFWRQYIPHSAIKWGMHSGIPSSNRSGIYVIRLEQVLRAQICVCLSCMECLIHHHGILH SIASDQGTYFTAKEVQQYVHALGIHWSSPYHFEAAGLIERWNGLLKSQLQHQLGDNTLQG WRRVLQKAVYALNQCPIYGTVSPIARIHGSRNQGVEVEVAPLTITPSGPLEKFLLPVPAT LRSAGLKVLGPEWRVLPPGHTTMIPLNWKLILPPGHFELLLPLSQQAKKGVTVLPGVIGL DYQDEISLLLILKSVYSSLPDGGKEEYAWNTGDPLGPLLVLPCPVIKVNGKLQQPNPSRT KNDPYPSGMKVWVTLPGTKPQPAEVLAEETQGIQNG >gi568815593f:65463311_65687393|GENSCAN_predicted_CDS_1|1551_bp atggctggttccacctgggttttaaaggaaggagcctcctggcagagccttgagcttatg acctcagtccaggagagccatggtgcccaggcacagcgatactatttatatgttgaagcc ctaacccccagtgcgatagtgtttggagacaggatcttggaagatattaagaatcagtgt cttgctattgcccactctcgtttcaaattcctggcctcaagccggcaattcccacttcag cttcccaaagcaagagatttcaggagacctctggccttttaccagagtaactgcactggg gaaaggggaatgatcagacttttcagggacttactggatactggctctgagctgacattt attccaggggacccaaaacatcattgtggtcctccagttaaagtaggggcttatggagta aaatttctagtggtccagtggtgtggggcctctcaagatattccttctaaggtgaaggat aagttgctgcatttgccccctcctacaaccaagaaagagggacaacgcctagtgggtcta tttggattttggaggcaatacattcctcattctgccataaagtggggcatgcacagcggc attccatcatcaaatcgaagtggtatatatgtgatcaggcttgagcaagtcctgagggca caaatatgcgtttgcctatcctgcatggaatgccttatccaccatcatggtattctgcat agcattgcctctgaccaaggcacatactttacagctaaagaagtgcagcagtatgttcat gctcttggaattcactggtcttctccttatcattttgaagcagctggattgatagaacgg tggaatggccttttgaagtcacaattacaacatcaactaggtgacaatactttgcagggc tggcgcagagttctccagaaggctgtgtatgctctgaatcagtgtccaatatatggtaca gtttctcccatagccaggattcacgggtccaggaatcaaggggtggaagtggaagtggca ccactcaccatcacccctagtggcccattagaaaaatttttgcttcctgttcccgcaaca ttacgttctgctggcctaaaagtcttaggtccagagtggagagtgctgccaccaggacac acaacaatgattccattaaactggaagttaatattaccacctggacactttgagctcctc ctacctttaagtcagcaggctaagaagggagttacagtgttgcctggggtgattggcttg gactatcaagatgaaatcagtctactactcatcttgaaatcagtctactcttccttacct gatggaggtaaggaagagtatgcatggaatacaggagatcctttagggcctctcttagta ttaccatgcccagtgattaaggtcaatgggaaactacaacaacccaatccaagcaggact aaaaatgacccatacccctccggaatgaaggtttgggtcactctaccaggtacaaaacca caacctgctgaggtgcttgctgaagagacacagggaatacagaatgggtag >gi568815593f:65463311_65687393|GENSCAN_predicted_peptide_2|210_aa MNQEDLDPDSTTDVGDVTNTEEELIRECEEMWKDMEESPKVQEKFLKEIKDVALVNTQIM RKQNSLIAVIEKVLLVYLDDQTSHHSLSQSVIHNKALALFNSTKAERDEEAAEEKCEASR GWFMRFKERSHLHNIKMQGEAASTEAEASTSYPEDLAKILEGGYDKQQIFNVDRTALYWK KMPSRTLIVREKLMPGSKASKGQVDSLDKD >gi568815593f:65463311_65687393|GENSCAN_predicted_CDS_2|633_bp atgaatcaggaggatctagatccggatagtactacagatgtgggagatgttacaaatact gaagaagaacttattagagaatgtgaagaaatgtggaaagatatggaagaatctcccaaa gtgcaggaaaaatttttaaaggaaattaaagatgttgctctagtgaacacacaaattatg agaaaacagaatagccttattgctgttatagagaaagttttactagtctacttagatgat caaaccagccatcattccctaagccaaagcgtaatccacaacaaagccctagctcttttc aattctacaaaggctgagagagatgaggaagctgcagaagaaaagtgtgaagctagcaga ggttggttcatgaggtttaaggaaagaagccatctccataacataaaaatgcaaggggaa gcagcaagtactgaggcagaagcgtcaacaagttatccagaagatctagctaagatactt gaaggtggctatgataaacagcaaattttcaatgtagacagaacagccctctattggaag aagatgccatctaggactcttatagttagagagaagttgatgcctggctccaaagcttca aaaggacaggttgactctcttgataaggactag >gi568815593f:65463311_65687393|GENSCAN_predicted_peptide_3|618_aa MAAESGSDFQQRRRRRRDPEEPEKTELSERELAVAVAVSQENDEENEERWVGPLPVEATL AKKRKVLEFERVYLDNLPSASMYERSYMHRDVITHVNVLISFIEIGVIESIAVSSEGALF CSVGDDKAMKVFDVVNFDMINMLKLGYFPGQCEWIYCPGDAISSVAASEKSTGKIFIYDG RGDNQPLHIFDKLHTSPLTQIRLNPVYKAVVSSDKSGMIEYWTGPPHEYKFPKNVNWEYK TDTDLYEFAKCKAYPTSVCFSPDGKKIATIGSDRKVRIFRFVTGKLMRVFDESLSMFTEL QQMRQQLPDMEFGRRMAVERELEKVDAVRLINIVFDETGHFVLYGTMLGIKVINVETNRC VRILGKQENIRVMQLALFQGIAKKHRAATTIEMKASENPVLQNIQADPTIVCTSFKKNRF YMFTKREPEDTKSADSDRDVFNEKPSKEEVMAATQAEGPKRVSDSAIIHTSMGDIHTKLF PVECPKTVENFCVHSRNGYYNGHTFHRIIKGFMIQTGDPTGTGMGGESIWGGEFEDEFHS TLRHDRPYTLSMANAGSNTNGSQFFITVVPTPWLDNKHTVFGRVTKGMEVVQRISNVKVN PKTDKPYEDVSIINITVK >gi568815593f:65463311_65687393|GENSCAN_predicted_CDS_3|1857_bp atggcggcggaaagtggtagcgattttcagcagagacgtagaaggcgccgggacccggag gaaccggaaaaaacagaactcagcgaaagagagctggcagtagcagtggcggtgtcccag gagaacgatgaggagaacgaagagcgctgggttggacctttacctgtggaggcaacactg gccaagaagaggaaagtcttagagtttgaaagagtctatcttgataatctccccagtgca tccatgtatgagcgcagttacatgcatagagatgttatcacccatgtgaatgttttaatt tcatttattgaaataggagttattgagagtattgcagttagctctgagggagcattgttc tgttctgtgggtgatgataaagcaatgaaggtgtttgatgtagtgaactttgacatgatc aacatgctgaaacttggctattttcctggacagtgtgagtggatctattgcccaggggat gcaatttcttcagttgctgcttccgaaaagagtacaggaaaaattttcatttatgatggc cgaggagataaccagccacttcatatttttgacaaactccatacatcacctcttactcag atacggctaaacccagtttacaaagcagtagtgtcttctgacaaatctgggatgattgaa tactggactgggcctcctcatgaatataaattccccaaaaatgtgaactgggaatataaa actgacactgatttatatgaatttgccaagtgtaaggcttatccaaccagcgtatgtttt tcaccagatgggaagaaaatagctactattggttctgatagaaaagttagaattttcaga tttgtaactggaaaactcatgagagtctttgatgaatcactaagcatgtttactgaactg caacagatgaggcaacagttaccagacatggaatttggccgacgaatggctgtagaacgt gagttggagaaggttgatgctgtaagattaattaatatagtttttgatgaaactggacac ttcgtgctgtatggaacaatgctgggcattaaagttataaatgtagagacaaaccggtgt gtgcggattttaggcaaacaagaaaatattagagtgatgcaattggctttgttccagggg atagccaaaaagcatcgtgctgcaactactatagaaatgaaagcttctgaaaatcctgtt cttcagaatattcaagctgacccaacaatagtctgtacatcattcaaaaagaatagattt tatatgtttaccaaacgagaaccagaagatacgaaaagtgcagattctgatcgagatgtt tttaatgagaaaccttctaaagaagaagtcatggcagctactcaagctgaaggacctaaa cgagtttcggacagtgccattatccacaccagcatgggagacattcacaccaaacttttt cctgttgagtgccctaagacagtggaaaacttctgtgttcacagcagaaatggttattat aatgggcatacatttcaccgtataattaagggctttatgattcagactggagatccaaca ggtactggtatgggaggagaaagcatatggggaggagaatttgaagatgaatttcattca acattacgacatgacaggccatacacactcagcatggctaacgcgggatcaaatactaat ggatcccagtttttcataacggtagtaccaacgccttggcttgataataagcatacagta tttggacgagtgactaaaggaatggaagttgtacagaggatctccaacgtcaaagtcaat cccaaaacagataagccctatgaggatgtcagcatcataaatattactgtcaagtaa >gi568815593f:65463311_65687393|GENSCAN_predicted_peptide_4|511_aa MVLECGVCEDVFSLQGDKVPRLLLCGHTVCHDCLTRLPLHGRAIRCPFDRQVTDLGDSGV WGLKKNFALLELLERLQNGPIGQYGAAEESIGISGESIIRCDEDEAHLASVYCTVCATHL CSECSQVTHSTKTLAKHRRVPLADKPHEKTMCSQHQVHAIEFVCLEEGCQTSPLMCCVCK EYGKHQGHKHSVLEPEANQIRASILDMAHCIRTFTEEISDYSRKLVGIVQHIEGGEQIVE DGIGMAHTEHVPGTAENARSCIRAYFYDLHETLCRQEEMALSVVDAHVREKLIWLRQQQE DMTILLSEVSAACLHCEKTLQQDDCRVVLAKQEITRLLETLQKQQQQFTEVADHIQLDAS IPVTFTKDNRVHIGPKMEIRVVTLGLDGAGKTTILFKLKQDEFMQPIPTIAVVFVVDSSH RDRISEAHSELAKLLTEKELRDALLLIFANKQDVAGALSVEEITELLSLHKLCCGRSWYI QGCDARSGMGLYEGLDWLSRQLVAAGVLDVA >gi568815593f:65463311_65687393|GENSCAN_predicted_CDS_4|1536_bp atggtgctagagtgtggagtttgtgaagatgtcttttctttgcaaggagacaaagttccc cgtcttttgctttgtggccataccgtctgtcatgactgtctcactcgcctacctcttcat ggaagagcaatccgttgcccatttgatcgacaagtaacagacctaggtgattcaggtgtc tggggattgaaaaaaaattttgctttattggagcttttggaacgactgcagaatgggcct attggtcagtatggagctgcagaagaatccattgggatatctggagagagcatcattcgt tgtgatgaagatgaagctcaccttgcctctgtatattgcactgtgtgtgcaactcatttg tgctctgagtgttctcaagttactcattctacaaagacattagcaaagcacaggcgagtt cctctagctgataaacctcatgagaaaactatgtgctctcagcaccaggtgcatgccatt gagtttgtttgcttggaagaaggttgtcaaactagcccactcatgtgctgtgtctgcaaa gaatatggaaaacaccagggtcacaagcattcagtattggaaccagaagctaatcagatc cgagcatcaattttagatatggctcactgcatacggaccttcacagaggaaatctcagat tattccagaaaattagttggaattgtgcagcacattgaaggaggagaacaaatcgtggaa gatggaattggaatggctcacacagaacatgtaccagggactgcagagaatgcccggtca tgtattcgagcttatttttatgatctacatgaaactctgtgtcgtcaagaagaaatggct ctaagtgttgttgatgctcatgttcgtgaaaaattgatttggctcaggcagcaacaagaa gatatgactattttgttgtcagaggtttctgcagcctgcctccactgtgaaaagactttg cagcaggatgattgtagagttgtcttggcaaaacaggaaattacaaggttactggaaaca ttgcagaaacagcagcagcagtttacagaagttgcagatcacattcagttggatgccagc atccctgtcacttttacaaaggataatcgagttcacattggaccaaaaatggaaattcgg gtcgttacgttaggattggatggtgctggaaaaactactatcttgtttaagttaaaacag gatgaattcatgcagcccattccaacaattgctgttgtgtttgttgtagatagcagtcat agagacagaattagtgaagcacacagcgaacttgcaaagttgttaacggaaaaagaactc cgagatgctctgctcctgatttttgctaacaaacaggatgttgctggagcactgtcagta gaagaaatcactgaactactcagtctccataaattatgctgtggccgtagctggtatatt cagggctgtgatgctcgaagtggtatgggactgtatgaagggttggactggctctcacgg caacttgtagctgctggagtattggatgttgcttga >gi568815593f:65463311_65687393|GENSCAN_predicted_peptide_5|110_aa MATLVVNKLGAGVDSGRQGSRGTAVVKVRDLACSILHHRPPFGSSWRPGGIAAAQASIGG MRGLWAPFGITSLVLSSASFLSYGLKLPGLLLMTCHHRDICEKELAGQWQ >gi568815593f:65463311_65687393|GENSCAN_predicted_CDS_5|333_bp atggctaccctggttgtaaacaagctcggagcgggagtagacagtggccggcagggcagc cgggggacagctgtagtgaaggtgagggaccttgcgtgctctattctccaccatcggcct cccttcggttcatcctggagacctggcggcatcgccgcggcccaggcttcgataggtggg atgcgcggcctctgggccccttttggcattacaagtctcgtcctgtcctctgcatccttt ctctcctatggcctgaaattgcctggtcttctgctcatgacatgtcatcatcgggacatt tgtgaaaaggagcttgcgggccagtggcagtga >gi568815593f:65463311_65687393|GENSCAN_predicted_peptide_6|345_aa MTTEVILHYRPCESDPTQLPKIAEKAIQDFPTRPLSRFIPWFPYDGSKLPLRPKRSPPVI SEEAAEDVKQYLTISEHDAKSHSYDCTVDLLEFQPSLKKQHLTWSHTLKEQTNSGNLVMR LTKPTLFTNIPVTCEEKDLPGDLFNQLMRDDPSTVNGAEVLMLGEMLTLPQNFGNIFLGE TFSSYISVHNDSNQVVKDILVKADLQTSSQRLNLSASNAAVAELKPDCCIDDVIHHEVKE IGTHIVSTFGSRAYLQPMDTRQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGER GRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCR >gi568815593f:65463311_65687393|GENSCAN_predicted_CDS_6|1038_bp atgactacagaagtaatattacattatcgaccatgtgagagtgatcccacacaactgcca aaaattgcagaaaaagcaattcaagactttcctactcgtccgctatcaagatttatacct tggtttccatatgatgggtccaagcttccactcagacctaaaagatcaccacctgtgatt tctgaagaggcagctgaagatgtgaaacagtacttaaccatttcagaacatgatgctaag tcacacagttatgattgcacagtagatctattggagtttcaacctagcttgaaaaagcag catttaacctggtcacacacactgaaggaacagactaattctggaaatctggtgatgcgg ctgactaagcctactttattcaccaatatcccagtaacatgtgaagagaaagacttacct ggagatctctttaaccagctgatgagagatgatccttcaaccgttaatggtgcagaagtt ttaatgttgggagaaatgctgactttaccacagaattttgggaatatatttttgggagag accttttccagttatatcagcgttcataatgatagcaatcaagttgtaaaagacatatta gtaaaagctgatcttcagacaagttctcagcgtttaaatctttcagcctccaatgctgca gtggctgaacttaaaccggattgttgtattgatgatgtcatacatcatgaagtcaaagaa attggaacacacattgtgtctacgtttgggtcaagagcatatttgcaaccaatggataca cgccagtacttatactgcctaaagccaaagaatgaatttgcagaaaaagcaggcatcatt aagggagtaacagtaattggaaaattggatatagtatggaaaacaaatctaggtgaaagg ggaaggttacagaccagccaacttcaaagaatggctccaggttatggagatgttaggttg tctttggaggcaataccagataccgtaaaccttgaagaaccttttcatattacctgtaaa ataacaaactgcaggtaa >gi568815593f:65463311_65687393|GENSCAN_predicted_peptide_7|159_aa XAAAQSKLGHYTDAIKDCEKAIAIDSKYSKAYGRMGLALTALNKFEEAVTSYQKALDLDP ENDSYKSNLKIAEQKLREVSSPAASLMQNPQVQQLMSGMMTNAIGGPAAGVGGLTDLSSL IQAGQQFAQQIQQQNPELIEQLRNHIRSRSFSSSAEEHS >gi568815593f:65463311_65687393|GENSCAN_predicted_CDS_7|480_bp nnggctgctgctcagagcaaattaggtcactacacagatgcgataaaggattgtgaaaaa gcaatagcaattgattcaaagtacagcaaggcctatgggagaatggggctggccctcact gccttgaataaatttgaagaagcagttacaagttatcaaaaggcattagatcttgaccct gaaaatgattcctataagtcaaatctgaaaatagcagaacagaagttaagagaggtatcc agtcctgcggcaagtttaatgcagaaccctcaagttcaacagctaatgtcaggaatgatg acaaatgccattgggggacctgctgctggagttgggggcctaactgacctgtcaagcctc atccaagcgggacagcagtttgctcagcagatacagcaacaaaatcctgaacttatagag caactgagaaatcacatccggagcagatcattcagcagcagcgctgaagagcattcctga