GENSCAN 1.0 Date run: 8-Nov-116 Time: 14:02:09 Sequence gi568815587f:73705025_73964367 : 259343 bp : 43.56% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.12 Intr - 2489 2396 94 1 1 73 111 53 0.022 6.07 1.11 Intr - 6245 6135 111 1 0 25 34 139 0.018 1.69 1.10 Intr - 11338 11227 112 2 1 89 64 70 0.925 4.14 1.09 Intr - 13866 13761 106 0 1 90 58 99 0.962 6.89 1.08 Intr - 15875 15822 54 2 0 126 79 44 0.907 6.48 1.07 Intr - 36497 36430 68 0 2 101 95 33 0.051 3.92 1.06 Intr - 55161 55024 138 1 0 95 81 42 0.625 4.64 1.05 Intr - 55618 55542 77 0 2 34 101 90 0.633 4.06 1.04 Intr - 55812 55661 152 0 2 53 18 124 0.566 0.86 1.03 Intr - 56441 56309 133 1 1 152 92 50 0.731 12.45 1.02 Intr - 92304 92059 246 2 0 67 81 135 0.697 7.27 1.01 Init - 92620 92547 74 1 2 87 55 103 0.672 5.50 1.00 Prom - 97035 96996 40 -5.06 2.00 Prom + 102521 102560 40 -4.46 2.01 Init + 103476 103479 4 2 1 89 72 0 0.080 -1.34 2.02 Intr + 120684 120772 89 1 2 105 82 76 0.661 8.39 2.03 Intr + 139783 139952 170 0 2 116 86 109 0.991 12.24 2.04 Intr + 154883 154985 103 2 1 99 96 69 0.981 8.98 2.05 Intr + 158148 158237 90 2 0 91 92 24 0.750 3.29 2.06 Term + 159272 159346 75 1 0 110 37 121 0.998 7.04 2.07 PlyA + 159557 159562 6 1.05 3.03 PlyA - 160078 160073 6 1.05 3.02 Term - 168355 168091 265 0 1 68 53 231 0.111 12.58 3.01 Init - 182051 181960 92 2 2 76 81 59 0.640 3.97 3.00 Prom - 182515 182476 40 -4.86 4.00 Prom + 185000 185039 40 -5.36 4.01 Init + 195324 195396 73 2 1 68 64 74 0.456 4.23 4.02 Intr + 204375 204569 195 1 0 66 100 117 0.953 10.09 4.03 Intr + 219591 219673 83 1 2 123 87 36 0.762 6.36 4.04 Intr + 226155 226301 147 2 0 59 47 102 0.312 3.63 4.05 Term + 231033 231068 36 2 0 109 49 73 0.741 2.84 4.06 PlyA + 234612 234617 6 1.05 5.02 PlyA - 235851 235846 6 1.05 5.01 Sngl - 241530 241186 345 0 0 82 42 220 0.366 12.84 5.00 Prom - 242247 242208 40 -5.66 6.00 Prom + 245221 245260 40 -8.86 6.01 Init + 246046 246113 68 0 2 96 88 58 0.955 7.14 6.02 Intr + 252380 252544 165 2 0 76 55 62 0.529 0.78 6.03 Intr + 253293 253396 104 0 2 113 84 120 0.937 13.92 6.04 Intr + 254470 254631 162 2 0 30 83 229 0.983 16.55 6.05 Term + 256484 256704 221 0 2 71 43 133 0.989 4.30 6.06 PlyA + 258164 258169 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 6245 6092 154 1 1 25 49 162 0.808 3.39 S.002 Init + 74471 74614 144 1 0 101 77 91 0.860 7.68 S.003 Term - 91373 91139 235 1 1 40 36 120 0.833 -2.21 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:73705025_73964367|GENSCAN_predicted_peptide_1|455_aa MPAAAAGRAAAGAGTGTGSLRGRTRSQVLVLQPEQMEYADKRRVSKTKKSFIEWQNSSEE DLGVGSSSLQAGRLNVFPALSREVSRVDSSSLQPVVPMSVQLWLSPRLRGATLLMAATVL RFPQLEGCARSHAATSLACGLALSPFRLAHYATLPGGGGGCQSVESPAALQPGSSTGPCR GRESSVPALPLAFFVSWLEQHRSTMSTGGDFGNPLRKFKLVFLGEQSVPQLCEAHSGKPH SRERALFNLGQGGWDLGVVFPRGLALRSADVVEDFCVDMFSALLGKYQGAWSLDHMATIG IDFLSKTMYLEDRTIRLQLWDTAGQERFRSLIPSYIRDSAAAVVVYDITNVNSFQQTTKW IDDVRTERGSDVIIMLVGNKTDLADKSWTAAVELSIKYFQEKLQQDLEAEHGRGRGYSPH TTVLQVSIEEGERKAKELNVMFIETSAKAGYNVKQ >gi568815587f:73705025_73964367|GENSCAN_predicted_CDS_1|1365_bp atgccggctgctgcagcagggcgggcagctgcaggtgctggcacaggcactggctctctg cgaggccggaccaggtcccaagttcttgtcctgcaaccagaacaaatggagtatgcagac aagcggagggtgagcaagacaaagaagagctttattgagtggcagaatagctcagaggag gaccttggagtgggcagctcctctctgcaggcaggtcgtctgaatgtcttcccagctctc agcagagaagtctctagagtggatagctcctctctgcagccggttgtcccgatgtctgtt cagctctggctgagcccaaggctgagaggcgccacacttcttatggccgccacagtgctc cgctttccacagctcgagggctgcgcgcgctctcacgctgccacctcactcgcgtgtggg ctcgccctcagccccttccgattggctcattatgctactctgccgggaggcggcggcggc tgccagtctgtggagagtcctgctgccctccagccgggctcctccaccgggccttgcagg ggccgagagagctcggtgcccgcccttccgctcgcctttttcgtcagctggctggagcag catcgttccacaatgtccacgggcggagacttcgggaatccgctgaggaaattcaagctg gtgttcctgggggagcaaagcgttcctcagctctgtgaggcccactcagggaagccccac tcccgagagcgggccctcttcaaccttggccaaggagggtgggatctgggcgttgttttc cccagaggcttggccctgcgaagtgctgatgtcgtggaagatttttgtgtggacatgttt tcagctcttttgggtaaataccaaggagcatggtcgctggatcatatggcaacaattggc attgactttttatcaaaaactatgtacttggaggatcgaacaatcaggcttcagctgtgg gatactgcgggtcaggaacgtttccgtagcctcattcccagttacatccgtgattctgct gcagctgtagtagtttacgatatcacaaatgttaactcattccagcaaactacaaagtgg attgatgatgtcagaacagaaagaggaagtgatgttatcatcatgctagtaggaaataaa acagatcttgctgacaagagctggactgctgctgtggagctcagcatcaagtacttccag gagaagcttcagcaggacctagaggcagagcatggtagaggtagaggatacagccctcat accaccgtgctgcaagtgtcaattgaggagggagagaggaaagccaaagagctgaatgtt atgtttattgaaactagtgcaaaagctggatacaatgtaaagcag >gi568815587f:73705025_73964367|GENSCAN_predicted_peptide_2|176_aa MGGILLSISRPYKTKPTHGIGKYKHLIKAEEPKKKKGKVEVRAINLGTDYEYGVLNIHLT AYDMTLAESYAQYVHNLCNSLSIKVEESYAMPTKTIEVLQLQDQGSKMLLDSVLTTHERV VQISGLSATFAEIFLEIIQSSLPEGVRLSVKEHTEEDFKGRFKARPELEELLAKLK >gi568815587f:73705025_73964367|GENSCAN_predicted_CDS_2|531_bp atgggtggaattctactaagtatcagtcggccctacaagacaaagcccacccacggcatt ggaaagtacaagcacttaattaaagcagaagagcccaagaagaagaagggaaaagtggaa gtgagagccattaatttggggacagattatgaatatggggttttaaatattcatctgact gcatatgatatgaccctggcagagagttatgcccagtatgttcacaacctctgcaactct ctctccattaaagtcgaggaaagttatgcaatgccaaccaaaaccatagaagtgttgcag ttgcaggaccaaggcagcaaaatgctcctggactcagtgcttaccacccatgagcgagtg gttcagatcagcggtttgagtgctacgtttgcagaaattttcttggaaataatccaaagc agtcttcctgaaggagtcagactgtcagtgaaggagcacactgaagaagacttcaaggga cgattcaaagctcgaccagaactggaagaactgttggccaagttgaagtag >gi568815587f:73705025_73964367|GENSCAN_predicted_peptide_3|118_aa MAPASASDESLRLLPLMAEGEGEPVYAEITWMSTSVPQGHTWTQRVKKDDEEEDPLDQLI SRSGCAASHFAVQECMAQHQDWRQCQPQVQAFKDCMSEQQARRQEELQRRQEQAGAHH >gi568815587f:73705025_73964367|GENSCAN_predicted_CDS_3|357_bp atggcaccagcatctgcttctgatgagagcctcaggctgcttccactcatggcagaagga gaaggggagccagtgtatgcagagatcacatggatgtcaacctcagtccctcaaggccat acctggacccaacgggtgaagaaagacgatgaggaggaggacccgctggaccagctgatc tcccgctctggctgtgctgcctcccactttgcagtgcaggagtgcatggcccagcaccag gactggcggcaatgccagccacaggtgcaggcgttcaaggattgcatgagtgaacagcag gcgaggcggcaagaggagctgcagaggaggcaagaacaagccggtgcccaccactga >gi568815587f:73705025_73964367|GENSCAN_predicted_peptide_4|177_aa MDAQLKIWSAEDASCVVTFKGHKGGILDTAIVDRGRNVVSASRDGTARLWDCGRSACLGV LADCGSSINGVAVGAADNSINLGSPEQMPSDGSCFIVQQDLDYVTELTGADCDPVYKGSL FYIATTFVRSFSYVSRVDVLVVIYAVTLDSCSEIQTQTEGKETSSQKTAIVIMAHSQ >gi568815587f:73705025_73964367|GENSCAN_predicted_CDS_4|534_bp atggatgcccagctgaagatatggtcagctgaagatgctagctgcgtggtgaccttcaaa ggtcacaaaggaggtatcctggatacagccatcgttgatcgggggaggaatgtggtgtct gcttctcgagatgggacagcacgactttgggattgtgggcgctcagcctgcttgggagtc cttgcagattgtggttcttctatcaatggagtggcggtgggtgctgctgacaactccata aaccttggctcccctgagcagatgcccagtgatggaagctgttttattgtccagcaagac ttagactatgtcactgagctcactggggctgactgtgaccctgtgtacaagggctcactc ttctacattgccaccacattcgtccgctccttttcatacgtctccagagtcgatgtcctg gttgtgatatatgcagtgacccttgacagctgctctgaaatccagacccaaacagaagga aaggagaccagctctcagaaaaccgccatcgtcatcatggcccattctcaatga >gi568815587f:73705025_73964367|GENSCAN_predicted_peptide_5|114_aa MELDEWERTGRDFKKAYKDGAKIPVSVWSVWALIKAAPEPFQTDDEADSDEEEEDECKKL TSDSEREAQEPEEIIEKKGKLKKTYFTGPSAPPAELSEWPPPPSPTMGEKVKQL >gi568815587f:73705025_73964367|GENSCAN_predicted_CDS_5|345_bp atggagttggatgaatgggaaagaactggcagagattttaaaaaggcatataaagatgga gccaaaattccagtttctgtttggtcagtgtgggcgttaataaaggcagctcctgagcca tttcaaacagatgatgaggcagattcagatgaagaagaggaggatgagtgtaagaaacta acatcagattctgaacgcgaggcacaggagccggaggaaatcatagaaaagaaaggaaag ctgaaaaagacatattttactggtccatcagctccgcctgctgaattaagtgaatggcca cctcctccctctcccacaatgggtgagaaagtgaagcagctgtaa >gi568815587f:73705025_73964367|GENSCAN_predicted_peptide_6|239_aa MGQDYYSVLGITRNSEDAQIKQATWANGPFIKLSTNHPIRSISPPPRVTADTYSQTLRCL PFRNRPAGTTCINPPSSRYRRLALKHHPLKSNEPSSAEIFRQIAEAYDVLSDPMKRGIYD KFGEEGLKGGIPLEFGSQTPWTTGYVFHGKPEKVFHEFFGGNNPFSGMKLVGGPTKAAAE KPSVSTTVPTSSRQSHGEMPAAFFLPPKQLRNYLMNRIQSQNLSCKRVWGMFLLALQIL >gi568815587f:73705025_73964367|GENSCAN_predicted_CDS_6|720_bp atgggccaggattattactctgtgctcgggatcactcgcaattcagaggatgcccagatc aagcaggccacctgggcgaatggtccctttattaagctctccacaaaccacccaatccgc tccatctctcccccaccccgggtcactgccgatacatattctcagacgctgcgatgtctg cccttcagaaacaggccagctggtacgacttgtatcaaccctccctcctccaggtaccgc agactcgcccttaagcaccacccgttgaagtcaaatgagccgtcttcagcagagattttc aggcaaatagcagaggcctacgacgtgctgagtgaccccatgaagagaggcatctacgac aagtttggagaagagggcctgaagggtgggattcctttggagtttggatcccagacccca tggacaactggttacgtcttccatggcaaacctgaaaaggtgttccacgagttctttggt ggaaacaaccccttcagtggcatgaaactcgtgggtggacccacaaaggctgctgcagaa aaacctagtgtctccacaacagtgcccaccagctccagacagagccacggagagatgcct gccgccttcttcctgcctcccaagcagctcaggaactacctgatgaacagaattcaaagc cagaaccttagctgcaagagagtctgggggatgttccttttggctctacagattctctag