GENSCAN 1.0 Date run: 8-Nov-116 Time: 15:51:15 Sequence gi568815593r:172238372_172554352 : 315981 bp : 46.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 8663 8733 71 1 2 75 95 97 0.843 9.62 1.02 Term + 13371 13503 133 0 1 45 46 73 0.559 -3.64 1.03 PlyA + 13522 13527 6 1.05 2.04 PlyA - 14882 14877 6 1.05 2.03 Term - 16689 16474 216 0 0 96 48 103 0.515 4.24 2.02 Intr - 17158 16869 290 2 2 49 89 189 0.843 12.06 2.01 Init - 45294 45225 70 0 1 88 92 106 0.685 12.21 2.00 Prom - 49039 49000 40 -5.46 3.17 PlyA - 49046 49041 6 1.05 3.16 Term - 87009 86905 105 0 0 77 54 90 0.705 2.91 3.15 Intr - 101545 100064 1482 1 0 118 34 1273 0.394 114.84 3.14 Intr - 107890 107765 126 1 0 28 105 238 0.966 20.38 3.13 Intr - 111941 111844 98 0 2 41 60 62 0.320 -1.57 3.12 Intr - 112218 111992 227 2 2 122 70 194 0.924 18.63 3.11 Intr - 115634 115487 148 0 1 119 55 239 0.752 22.99 3.10 Intr - 120506 120402 105 0 0 112 97 60 0.990 9.49 3.09 Intr - 124498 124364 135 2 0 93 75 241 0.996 23.84 3.08 Intr - 127644 127471 174 1 0 98 66 25 0.553 1.21 3.07 Intr - 135444 135419 26 2 2 118 99 22 0.329 3.97 3.06 Intr - 143756 143665 92 2 2 127 89 152 0.440 17.99 3.05 Intr - 156268 156192 77 2 2 136 116 128 0.924 19.63 3.04 Intr - 159747 159679 69 1 0 79 89 26 0.642 0.95 3.03 Intr - 166624 166514 111 2 0 111 80 25 0.950 4.35 3.02 Intr - 167981 167906 76 2 1 54 96 90 0.777 5.49 3.01 Init - 168719 168630 90 2 0 78 34 72 0.324 1.29 3.00 Prom - 168807 168768 40 -5.46 4.00 Prom + 170708 170747 40 -2.96 4.01 Init + 173901 173966 66 2 0 111 67 81 0.936 9.40 4.02 Intr + 177555 177619 65 2 2 102 121 3 0.156 2.52 4.03 Intr + 184020 184272 253 0 1 76 50 201 0.011 12.54 4.04 Intr + 184673 184820 148 1 1 130 55 40 0.001 4.71 4.05 Term + 201800 201903 104 0 2 62 43 173 0.292 8.64 4.06 PlyA + 202084 202089 6 -4.73 5.00 Prom + 202556 202595 40 -2.06 5.01 Init + 204310 204368 59 0 2 68 91 81 0.845 7.18 5.02 Intr + 206448 206549 102 0 0 65 76 50 0.377 0.79 5.03 Term + 212863 212962 100 1 1 105 38 61 0.049 0.40 5.04 PlyA + 214476 214481 6 1.05 6.00 Prom + 214541 214580 40 -6.66 6.01 Init + 215980 216468 489 0 0 103 1 276 0.076 13.60 6.02 Intr + 217855 218020 166 0 1 71 67 27 0.199 -1.57 6.03 Intr + 221447 221628 182 0 2 108 49 109 0.748 8.59 6.04 Intr + 227531 227571 41 1 2 118 28 33 0.063 -2.68 6.05 Intr + 228202 228238 37 1 1 120 90 15 0.085 3.16 6.06 Intr + 241090 241233 144 0 0 32 83 114 0.071 5.78 6.07 Intr + 243037 243093 57 0 0 114 101 -9 0.048 2.08 6.08 Intr + 245151 245234 84 2 0 87 25 104 0.048 4.02 6.09 Intr + 252633 252725 93 2 0 106 65 12 0.058 0.86 6.10 Intr + 254223 254307 85 2 1 92 72 18 0.369 -0.01 6.11 Intr + 265857 265889 33 1 0 118 81 48 0.267 5.29 6.12 Intr + 276433 276557 125 2 2 47 110 57 0.085 4.10 6.13 Term + 301189 301350 162 0 0 50 42 92 0.018 -1.26 6.14 PlyA + 301560 301565 6 1.05 7.00 Prom + 302535 302574 40 -1.06 7.01 Init + 304614 304722 109 2 1 112 97 60 0.414 8.54 7.02 Term + 306818 306861 44 0 2 106 44 20 0.247 -3.28 7.03 PlyA + 307759 307764 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 184125 184045 81 0 0 141 110 187 0.982 26.03 S.002 Intr - 241321 241265 57 1 0 121 97 36 0.815 6.88 S.003 Intr - 243426 243373 54 0 0 115 98 3 0.829 3.18 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:172238372_172554352|GENSCAN_predicted_peptide_1|67_aa MSHSIRPKIAVLKIRLHAIYKKSLPVLQEMLKVILEAERKDVNEKHKTSQSIKLVKVKFR IQMVPEV >gi568815593r:172238372_172554352|GENSCAN_predicted_CDS_1|204_bp atgagccacagcatccggcccaagattgcagttttaaagattcgactacatgctatctac aagaaatccctacctgtcttacaagaaatgctaaaggtaattcttgaagctgaaagaaag gacgtcaatgagaaacacaaaacatctcaaagtataaaactggtgaaagtaaaattcaga atacagatggttcctgaagtatag >gi568815593r:172238372_172554352|GENSCAN_predicted_peptide_2|191_aa MGGCVGAQHDSSGSLNENSEGTGGSPDLGLYFSWAASLTTPLKAKHQSTLEGLKECSCLT QFLLSKRPVDPVSVSYSSNYMESMKPNKYGVIYSTQLPDEFFQTLEGLWHGIQMEPVDFM MMAAALSWHRIWSLGILLVIQPVVVQPIPFMYMSHLQEPLMVSLRRRRRKRSSMEEMKNS SSSMQVPVIES >gi568815593r:172238372_172554352|GENSCAN_predicted_CDS_2|576_bp atgggcgggtgtgtgggcgcccagcacgactcctcgggcagcctcaacgagaactcggag ggcaccggaggcagcccggatctgggcctttatttctcgtgggcggcgtccctaaccaca cctctcaaagccaaacaccagagcaccctagaaggtttaaaagaatgctcatgtttgacc cagttcctgttaagcaagaggcccgtggaccctgtctcggtgtcatactcatctaattac atggaatccatgaagcccaacaagtatggggtcatctactccacacaattgcctgatgag ttctttcagaccctagaaggcctgtggcatggaatacagatggagccagtggacttcatg atgatggcagctgccctctcctggcacagaatatggagcctggggatcttgctggtcatc cagccggtggtggtgcagcccatcccctttatgtacatgagtcacctccaggagcctctc atggtctccttgaggaggaggaggaggaagaggtcttccatggaggagatgaaaaattcc agtagtagcatgcaagtacctgtaattgaatcataa >gi568815593r:172238372_172554352|GENSCAN_predicted_peptide_3|1046_aa MGSKKQILVILYAKSDISHEVPEAFRDTFMMQMLDKFPMEGGQKDPKQRIIPFLPEQPCG IGITAPISQMRKIEAQGTYGRLHRWQNWNSDTETQENSRGKEKVPHFQVVTVQKPSKILF RRSHIRDVAVKRLIPIDEYCKALIQLPPYISQCDEVLQFFETRPEDLNPPKEEHIGKKKS VLSVKLRHRETHLPGPAHLRAQTEGKLQSWGFEPSGRLYPGVDAGLDSEKSESSDQDTGG DQTSVDPMVLEQYVVVANYQKQESSEISLSVGQVVDIIEKNESGWWFVSTAEEQGWVPAT CLEGQDGVQDEFSLQPEEEEKYTVIYPYTARDQDEMNLERGAVVEVIQKNLEGWWKIRYQ PLPPTVAGYQGKEGWAPASYLKKNSGEPLPPKPGPGSPSHPGALDLDGVSRQQNAVGREK ELLSSQRDGRFEGRPVPDGDAKQTLCSRRLTSSLILTAAQLLPHPPDKKTEAQKPRPRGL NLPKPPIPPQVEEEYYTIAEFQTTIPDGISFQAGLKVEVIEKNLSGWWYIQIEDKEGWAP ATFIDKYKKTSNASRPNFLAPLPHEVTQLRLGEAAALENNTGSEATGPSRPLPDAPHGVM DSGLPWSKDWKGSKDVLRKASSDMSASAGYEEISDPDMEEKPSLPPRKESIIKSEGELLE RERERQRTEQLRGPTPKPPGVILPMMPAKHIPPARDSRRPEPKPDKSRLFQLKNDMGLEC GHKVLAKEVKKPNLRPISKSKTDLPEEKPDATPQNPFLKSRPQVRPKPAPSPKTEPPQGE DQVDICNLRSKLRPAKSQDKSLLDGEGPQAVGGQDVAFSRSFLPGEGPGRAQDRTGKQDG LSPKEISCRAPPRPAKTTDPVSKSVPVPLQEAPQQRPVVPPRRPPPPKKTSSSSRPLPEV RGPQCEGHESRAAPTPGRALLVPPKAKPFLSNSLGGQDDTRGKGSLGPWGTGKIGENREK AAAASVPNADGLKDSLYVAVADFEGDKDTSSFQEGTVFEVREKNSSGWWFCQDHKNVHLE SPVEVPLRRIKMRTALGKHVAASMEM >gi568815593r:172238372_172554352|GENSCAN_predicted_CDS_3|3141_bp atggggagtaagaagcagatcctcgtgattttgtatgccaaatccgatatttcacatgaa gtccctgaagctttcagagatacgtttatgatgcagatgttggacaaatttcccatggaa ggaggacagaaggaccccaagcagcggatcatcccctttctgccagagcagccctgcggc ataggcattactgcacccatttcacagatgaggaaaattgaggctcagggaacctatggg cgcttgcacagatggcagaactggaattcagacacagaaacacaggagaattccaggggg aaggagaaggttccacatttccaagttgtaactgttcagaaaccaagtaagattctcttc agacgaagccacatccgggacgtggctgtcaaacgcctgataccaattgatgaatactgt aaggccctcatccagctgcccccctacatctctcagtgtgatgaggtgctgcagttcttt gagacaagacctgaggacctgaatccccccaaagaggagcacattgggaaaaagaaatct gttctttctgtcaagctgaggcacagggaaacacacttacctggtccagctcacctgagg gcccagacagaagggaaactgcaatcctggggttttgagccctcgggcaggctgtatcct ggggtagatgctggattggattcagagaagagtgagtctagtgaccaggacacagggggt gaccaaacctcagtggaccccatggtcctggagcagtatgtggtggtagccaactaccag aagcaggagagttcggagatcagcctcagcgtggggcaggtggtggacatcatcgagaag aatgagtcaggttggtggttcgtcagcactgccgaggagcaaggctgggtccctgcaacg tgcctcgaaggccaggatggggtgcaggatgagttttctctgcagcctgaagaagaggag aagtacacagtcatctacccgtacacagctcgggaccaggatgaaatgaacctggagaga ggggctgtggtggaggtcatccagaaaaacctggaaggctggtggaagatcaggtaccag ccgctgccccccacggttgctgggtaccagggcaaagaaggctgggcccccgcctcctac ctaaagaagaacagtggggagcccttgcccccgaagccaggccctggctcaccctcccac ccgggtgcccttgacttggatggtgtttcccggcagcagaacgcggtgggcagggagaag gagctgctcagcagccagagggacgggcggtttgaaggccgcccggtgcccgacggtgac gccaagcagactctgtgctcaaggcgtctcacatcgtccctcatcctcacagcagcccag ctgctcccccatcctccagataagaaaacagaggcccagaaaccccggcctcgaggcctc aacctgccgaagccgcccatcccgccccaagtggaggaagagtattacaccatcgccgaa ttccagacaaccatcccagacggcatcagcttccaggcaggcctgaaggtcgaggtgatc gagaaaaacttgagtggctggtggtacattcagattgaagataaggaagggtgggccccg gccaccttcattgacaagtacaagaagacgagcaacgcgtcgagacccaactttctggct cccctgccccacgaggtgacccagctccggctgggggaagcagcagcgctggagaacaac acgggcagcgaagccacgggcccctcccggcccctgcctgacgcaccgcatggtgtcatg gactcggggttgccatggtctaaagactggaagggcagtaaggatgtcctgaggaaggca tcttcagacatgtctgcgtcagcaggctacgaggagatctcagaccccgacatggaggag aagcccagcctccctccgcggaaagaatccatcatcaagtcggagggggagctgctggag cgggagcgggagcggcagaggacggagcagctccggggccccactcccaagcctccgggc gtgattttgccgatgatgccagccaaacacatccctccagcccgggacagcaggaggcca gagcccaaacctgacaaaagcagactgttccagctgaaaaatgacatggggctggagtgt ggccacaaggtcttggccaaggaagtgaagaagcccaacctccggcccatctccaaatcc aaaactgacctgccagaggagaagccagatgccactccccagaatcccttcttgaagtcc agacctcaggttaggccaaaaccagctccttcccccaaaacggagccacctcagggcgaa gaccaagtcgacatctgcaacctcaggagtaagctcaggcctgccaagtcccaagacaag tccttgttggatggggagggcccccaggcagtagggggccaagacgtggccttcagccga agcttcctcccaggagaggggcctggccgcgcccaggacaggacgggcaaacaggatggt ctcagcccaaaagagatttcctgcagagcccctccgaggccagccaagaccacagatcct gtgtctaagagcgtgcctgttcctctccaagaggctccccagcagagacctgtggtccca ccccgcagaccacctcccccaaagaaaacctcttcgtcatccaggccgctcccagaggtc agaggtccacagtgtgaaggccacgaaagcagggcagctcccaccccaggccgtgctctc ctcgtccctccaaaagccaaaccttttctctccaactctttggggggccaggatgacacg cgaggcaaaggcagcctggggccatgggggaccggcaagattggagaaaacagggagaaa gcagctgcagcctctgtccccaatgccgacggcctgaaggactctttgtatgtggccgtg gccgactttgaaggagacaaagacaccagcagcttccaggaagggacagtgtttgaagtc cgggagaagaacagcagtggctggtggttctgccaggatcataagaatgttcacttggag agccctgtggaagtccctcttagaagaatcaagatgcggacagcacttgggaagcacgtg gcagcttccatggagatgtga >gi568815593r:172238372_172554352|GENSCAN_predicted_peptide_4|211_aa MEVFYILLSNPVATSLLVAIEQVNETCGQMSSKSIRHHYQALLSSPETSNPYLEVKKLAV APVNGLGGAAGPRDPDDVDLREQQRRWVLTPEGGLPISGTQGEHKTQSQKDVGFSLSSAN DWLDKLLADRCEVLMKLRMKDHFINQWAVLPTCVTRHGTHRKQYIDNKPGQTEVTIRELS RVNGAANKSTHYAQGSSIKAVFKEHAVEMAK >gi568815593r:172238372_172554352|GENSCAN_predicted_CDS_4|636_bp atggaggtgttctacatcttgctgtccaatccagtggccactagcctcctggtagctatt gagcaggtgaatgagacgtgtggtcagatgtcaagtaagagcatcaggcatcactatcag gcccttctaagctcaccagagacctcaaatccttacctggaggtcaaaaaacttgctgta gcgccggtaaatggcctcggtggagccgctggaccacgtgacccggatgatgtagacctg cgggagcaacagaggagatgggtgttaacaccagaaggtggtctcccaatctctgggacc cagggggagcacaagactcagagtcagaaagacgtgggtttcagccttagctctgccaat gactggctggacaagttgcttgctgatagatgtgaggtgctgatgaaattaaggatgaag gaccattttataaaccagtgggctgtgcttccaacctgtgtcacacgtcacggcacgcac agaaaacaatacattgataacaagcccgggcaaactgaggtgactattagagagctgtcc agagtgaatggagccgccaacaagagcacccactacgcgcaaggcagtagcatcaaagct gtcttcaaggagcacgcggtcgagatggcaaagtag >gi568815593r:172238372_172554352|GENSCAN_predicted_peptide_5|86_aa MHITSQINDDDDDGKDNNNSDCLWPRALASAFVCCTGAMAFLGGDHAFSPVVLRQVSDEQ VVAEMAANGERGSGKLGGVVKRTLKN >gi568815593r:172238372_172554352|GENSCAN_predicted_CDS_5|261_bp atgcacatcactagtcagataaatgatgatgatgatgatggtaaagataataataacagt gactgcctctggccccgagccctggcctcagccttcgtgtgctgtactggagctatggcc ttcctggggggggaccatgccttctctcctgtggttctcaggcaagtgagtgatgagcag gtagtggccgagatggctgcaaatggggagaggggtagtgggaaattagggggcgtagtg aagaggactctcaaaaactga >gi568815593r:172238372_172554352|GENSCAN_predicted_peptide_6|565_aa MAAPPPAAGRARVRVARGAGSAGGAPPRAGNLELSAIAAGAEHEPPPPPPPPFARRALQP GPAAAARPRPAPRTAPRTAPRTAPRAAGPPPLKGPAPTPAHRARTPPATSGPRPAFVPVL SRRAGGGDWTDPVARSLIDVSIHSSTLAPAAAIHRQFGEPDTRDSDSLAVASPGYYAILG ASLAPPTPVYIDPLLNCLHGPSLRVASVRCQDPHSCNSFSGKKLGLERDSLRDCCLIGWH CGVVITNVATLSNAQWPGTRGPEKQTTGYKKAGLMAKFQIKKGVAFKTTARRKYRQPAQV RAKSEGSRLEAVWHLKTLELLLSGTHALPIHLPSAPGAPADAEDAGNDSAPALIGPSSCR KTSSGLPLLYTMVVGDDSPHDDKMAATAPDKSAYDIQRGVVCNETNHPASLGGLGEGMRQ DVKRPTRWLSLVRSFQISWSPSLLKNHWRNEDSQPVEGKGWGCPEEKQTQGRFTGVTFTQ YITSTFQQEITRHTKKQKHILKRLDKHQDRSQGIDSHLRNRNPHKTLNLIKHKPSDYSSG NVAGVGEELHSNSSILAIDTKSLKL >gi568815593r:172238372_172554352|GENSCAN_predicted_CDS_6|1698_bp atggccgctcctccgcccgcagcgggccgagcgcgggtgcgggtggcgcggggtgcaggg agcgctgggggcgcgccgccgcgggcagggaacctggagctgagcgcaatcgcagccggg gccgagcacgagccgccgccgccaccgccgccgcccttcgctcggcgcgcactccagccg ggcccagccgccgccgcccgcccccggcccgccccccgcaccgccccccgcaccgccccc cgcaccgccccccgcgccgccggcccgccccccttaaagggcccggccccgacccccgcc caccgcgcgcgaaccccgccggccacgtccggcccccgcccggcctttgtcccggtgctg agccgccgcgccgggggaggggactggactgaccccgtcgcccgttcactgattgacgta tcgattcattccagcactctcgctccggcagccgccatccaccgccagtttggagagccg gacacccgggacagtgacagtcttgctgttgcaagccctgggtactacgccatcctcggt gcttccttagccccgcccacacccgtatacattgaccctttactaaactgtcttcatgga cccagcttgagggtggcatctgttcggtgccaggaccctcactcatgtaattccttctct gggaagaagctgggcttggaaagagattccctccgtgactgctgtctgattggctggcat tgcggggtggtcataactaatgtggccacgctttctaatgctcagtggcccggcacacgt ggtcctgagaaacagaccacaggctataagaaagcaggccttatggcaaaattccagatc aaaaaaggggtggccttcaagacaactgctcgtagaaagtacaggcagccagctcaggtc cgagcaaaaagtgagggaagccgtctggaggcagtgtggcacctgaagacgctggagctg ctgctgtcagggacacacgccctgcccattcatctgccttctgcgccgggagcccccgca gatgcggaagatgcagggaatgattcagccccagccttaataggaccgtctagttgcagg aaaacaagctcagggctcccactactctacacgatggttgttggtgatgactcccctcat gatgacaaaatggctgccacagctccagacaaatctgcatatgacatccagagaggggtg gtgtgtaatgagaccaaccaccctgcttctctagggggcttgggggaaggaatgagacaa gatgtgaagcgtccaacacggtggctgtccctggtcagatcattccagatttcctggtct ccttccctcctgaagaaccactggaggaacgaggactcccagcctgtggaaggaaagggc tggggctgcccagaggagaagcagacgcaaggccgatttactggagttacttttacccag tacatcacatccacctttcaacaggaaattacaagacatactaaaaagcaaaaacacatt ttgaagagactggacaaacatcaggacaggagtcagggaatagacagccatctacgaaat aggaaccctcataaaacactgaatctcataaaacacaaaccttcagactatagcagtggc aatgttgctggggttggtgaagagctgcactccaattcaagcattttggcaatagatacc aagagccttaaattgtga >gi568815593r:172238372_172554352|GENSCAN_predicted_peptide_7|50_aa MQRAGPQRTSPHMPCGGCETLSLRSELKAGSCQQLQESLSVVTEALILRT >gi568815593r:172238372_172554352|GENSCAN_predicted_CDS_7|153_bp atgcagcgagcaggtccccagagaacctctccccacatgccgtgtgggggttgcgagacc ctgtccctgcggagtgagctgaaggccgggagctgccagcagctgcaagaaagcctctct gtggttacagaggctctcatcttgaggacctag