GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:06:31 Sequence gi568815584f:103429979_103634818 : 204840 bp : 49.46% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 13062 13293 232 2 1 68 74 66 0.145 1.62 1.02 Intr + 21940 22005 66 2 0 120 94 61 0.935 8.78 1.03 Intr + 27164 27234 71 0 2 78 80 16 0.712 -1.30 1.04 Intr + 32427 32483 57 2 0 122 113 36 0.911 8.58 1.05 Intr + 35579 35743 165 1 0 98 -5 170 0.305 8.86 1.06 Intr + 35994 36113 120 2 0 94 74 73 0.933 7.19 1.07 Intr + 38055 38208 154 2 1 82 115 6 0.832 2.35 1.08 Intr + 45015 45232 218 1 2 33 84 108 0.549 3.12 1.09 Intr + 50409 50512 104 2 2 51 91 49 0.227 0.47 1.10 Intr + 61799 62056 258 2 0 46 107 101 0.187 4.38 1.11 Term + 72904 73249 346 1 1 65 40 566 0.075 43.47 1.12 PlyA + 73993 73998 6 1.05 2.09 PlyA - 74253 74248 6 1.05 2.08 Term - 90064 89886 179 2 2 104 48 411 0.999 36.55 2.07 Intr - 90333 90144 190 0 1 134 99 292 0.999 34.06 2.06 Intr - 90614 90491 124 1 1 39 50 266 0.998 18.49 2.05 Intr - 91456 91285 172 2 1 40 78 469 0.997 40.10 2.04 Intr - 91972 91840 133 1 1 103 97 257 0.999 28.32 2.03 Intr - 92199 92045 155 1 2 100 80 463 0.999 46.49 2.02 Intr - 92527 92323 205 1 1 115 75 553 0.999 55.47 2.01 Init - 92676 92557 120 0 0 98 -13 192 0.886 8.29 2.00 Prom - 95282 95243 40 -7.76 3.00 Prom + 95877 95916 40 -11.72 3.01 Init + 96264 96360 97 2 1 80 76 123 0.722 10.77 3.02 Intr + 99972 100331 360 1 0 98 105 454 0.748 43.29 3.03 Intr + 102604 102870 267 2 0 101 52 439 0.509 39.10 3.04 Term + 104572 104843 272 2 2 141 50 458 0.999 42.95 3.05 PlyA + 106047 106052 6 1.05 4.00 Prom + 110996 111035 40 -6.96 4.01 Init + 111471 111588 118 2 1 76 49 78 0.280 1.07 4.02 Intr + 113854 113940 87 2 0 96 71 66 0.540 5.64 4.03 Intr + 118282 118620 339 2 0 93 103 118 0.146 9.25 4.04 Term + 123043 123245 203 2 2 61 47 144 0.963 5.15 4.05 PlyA + 124504 124509 6 1.05 5.03 PlyA - 125142 125137 6 1.05 5.02 Term - 131214 129843 1372 2 1 94 44 1137 0.996 100.73 5.01 Init - 133025 132889 137 2 2 114 60 203 0.985 17.71 5.00 Prom - 138002 137963 40 -4.86 6.00 Prom + 138456 138495 40 -5.26 6.01 Init + 145105 145189 85 0 1 77 31 100 0.052 2.87 6.02 Intr + 157296 157386 91 1 1 58 96 70 0.122 3.85 6.03 Term + 160203 160308 106 0 1 111 49 162 0.999 12.48 6.04 PlyA + 160530 160535 6 1.05 7.05 PlyA - 161083 161078 6 1.05 7.04 Term - 170808 170672 137 1 2 50 44 94 0.459 -0.62 7.03 Intr - 175054 174970 85 1 1 87 111 37 0.807 5.29 7.02 Intr - 177715 177553 163 0 1 91 41 139 0.790 9.48 7.01 Init - 184941 184934 8 0 2 114 91 0 0.449 3.40 7.00 Prom - 185785 185746 40 -7.96 8.00 Prom + 187970 188009 40 -1.36 8.01 Init + 191087 191122 36 1 0 16 105 52 0.525 -0.29 8.02 Term + 195323 195445 123 1 0 140 42 83 0.865 7.28 8.03 PlyA + 197379 197384 6 1.05 9.00 Prom + 197567 197606 40 -7.86 9.01 Init + 199159 199516 358 0 1 93 101 221 0.865 19.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 70185 70076 110 1 2 82 47 115 0.916 5.47 S.002 Sngl + 72893 73249 357 1 0 42 40 576 0.917 44.26 S.003 Init + 157325 157386 62 1 2 84 96 33 0.853 4.42 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_1|596_aa MRGKYQILKNLNYYKGTFSATLKNVRISKEIDNFLGKHDLPKLTLEKNRYTSVTTEVEKV VNILPNLEFMIEFFEIYLKLFEVIETEKTLYLIMEYASGGEVFDYLVAHGRMKEKEARSK FRQIVSAVQYCHQKRIVHRDLKAENLLLDADMNIKIADFGFSNEFTVGGKLDTFCGSPPY AAPELFQGKKYDGPEVDELRERVLRGKYRIPFYMSTDCENLLKRFLVLNPIKRGTLELDA SDSSSSSNLSLAKVRPSSDLNNSTGQSPHHKVQRSVSSSQKQRRYSDHAGPAIPSVVAYP KRSQTSTADSDLKEDGISSRKSSGSAVGGKGIAPASPMLGNASNPNKADIPERKKSSTVP SSNTASGGMTRRNTYVCSERTTADRHSVIQNGKENSTIPDQRTPVASTHSISSAATPDRI RFPRGTASRSTFHGQPRERRTATYNGPPASPSLSHEATPLSQTRSRGSTNLFSKLTSKLT RSRNVSAEQKDENKEAKPRSLRFTWSMKTTSSMDPGDMMREIRKVLDANNCDYEQRERFL LFCVHGDGHAENLVQWEMEVCKLPRLSLNGVRFKRISGTSIAFKNIASKIANELKL >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_1|1791_bp atgagggggaaataccagatactgaagaatttaaattattataaaggaaccttttctgca actctgaaaaatgttagaatatccaaagaaattgataattttctaggaaaacatgactta ccaaaattaactctagaaaagaatcgatacacatcagtaacaacagaagttgagaaagta gttaacatattgccaaacctggaattcatgattgaattctttgagatctacttgaagtta ttcgaagtcattgaaactgaaaaaacactctacctaatcatggaatatgcaagtggaggt gaagtatttgactatttggttgcacatggcaggatgaaggaaaaagaagcaagatctaaa tttagacagattgtgtctgcagttcaatactgccatcagaaacggatcgtacatcgagac ctcaaggctgaaaatctattgttagatgccgatatgaacattaaaatagcagatttcggt tttagcaatgaatttactgttggcggtaaactcgacacgttttgtggcagtcctccatac gcagcacctgagctcttccagggcaagaaatatgacgggccagaagtggatgaactgaga gagagagtattaagagggaaatacagaattcccttctacatgtctacagactgtgaaaac cttctcaaacgtttcctggtgctaaatccaattaaacgcggcactctagagctggatgct agtgattccagttctagcagcaatctttcacttgctaaggttaggccgagcagtgatctc aacaacagtactggccagtctcctcaccacaaagtgcagagaagtgtttcttcaagccaa aagcaaagacgctacagtgaccatgctggaccagctattccttctgttgtggcgtatccg aaaaggagtcagaccagcactgcagatagtgacctcaaagaagatggaatttcctcccgg aaatcaagtggcagtgctgttggaggaaagggaattgctccagccagtcccatgcttggg aatgcaagtaatcctaataaggcggatattcctgaacgcaagaaaagctccactgtccct agtagtaacacagcatctggtggaatgacacgacgaaatacttatgtttgcagtgagaga actacagctgatagacactcagtgattcagaatggcaaagaaaacagcactattcctgat cagagaactccagttgcttcaacacacagtatcagtagtgcagccaccccagatcgaatc cgcttcccaagaggcactgccagtcgtagcactttccacggccagccccgggaacggcga accgcaacatataatggccctcctgcctctcccagcctgtcccatgaagccacaccattg tcccagactcgaagccgaggctccactaatctctttagtaaattaacttcaaaactcaca aggagtcgcaatgtatctgctgagcaaaaagatgaaaacaaagaagcaaagcctcgatcc ctacgcttcacctggagcatgaaaaccactagttcaatggatcccggggacatgatgcgg gaaatccgcaaagtgttggacgccaataactgcgactatgagcagagggagcgcttcttg ctcttctgcgtccacggagatgggcacgcggagaacctcgtgcagtgggaaatggaagtg tgcaagctgccaagactgtctctgaacggggtccggtttaagcggatatcggggacatcc atagccttcaaaaatattgcttccaaaattgccaatgagctaaagctgtaa >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_2|425_aa MALPARAADPADLGRVPGGAGGPGGGGLSGTREPGNPGVPPAAAMPFSNSHNALKLRFPA EDEFPDLSAHNNHMAKVLTPELYAELRAKSTPSGFTLDDVIQTGVDNPGHPYIMTVGCVA GDEESYEVFKDLFDPIIEDRHGGYKPSDEHKTDLNPDNLQGGDDLDPNYVLSSRVRTGRS IRGFCLPPHCSRGERRAIEKLAVEALSSLDGDLAGRYYALKSMTEAEQQQLIDDHFLFDK PVSPLLLASGMARDWPDARGIWHNDNKTFLVWVNEEDHLRVISMQKGGNMKEVFTRFCTG LTQIETLFKSKDYEFMWNPHLGYILTCPSNLGTGLRAGVHIKLPNLGKHEKFSEVLKRLR LQKRGTGGVDTAAVGGVFDVSNADRLGFSEVELVQMVVDGVKLLIEMEQRLEQGQAIDDL MPAQK >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_2|1278_bp atggcgctccccgcgcgcgctgcggaccccgctgaccttggccgcgtcccggggggcgcc ggggggcccggcggcgggggcctgagtggtacgcgggagcccgggaaccccggcgtgccg cccgccgccgccatgcccttctccaacagccacaacgcactgaagctgcgcttcccggcc gaggacgagttccccgacctgagcgcccacaacaaccacatggccaaggtgctgaccccc gagctgtacgcggagctgcgcgccaagagcacgccgagcggcttcacgctggacgacgtc atccagacaggcgtggacaacccgggccacccgtacatcatgaccgtgggctgcgtggcg ggcgacgaggagtcctacgaagtgttcaaggatctcttcgaccccatcatcgaggaccgg cacggcggctacaagcccagcgatgagcacaagaccgacctcaaccccgacaacctgcag ggcggcgacgacctggaccccaactacgtgctgagctcgcgggtgcgcacgggccgcagc atccgtggcttctgcctccccccgcactgcagccgcggggagcgccgcgccatcgagaag ctcgcggtggaagccctgtccagcctggacggcgacctggcgggccgatactacgcgctc aagagcatgacggaggcggagcagcagcagctcatcgacgaccacttcctcttcgacaag cccgtgtcgcccctgctgctggcctcgggcatggcccgcgactggcccgacgcccgcggt atctggcacaatgacaataagaccttcctggtgtgggtcaacgaggaggaccacctgcgg gtcatctccatgcagaaggggggcaacatgaaggaggtgttcacccgcttctgcaccggc ctcacccagattgaaactctcttcaagtctaaggactatgagttcatgtggaaccctcac ctgggctacatcctcacctgcccatccaacctgggcaccgggctgcgggcaggtgtgcat atcaagctgcccaacctgggcaagcatgagaagttctcggaggtgcttaagcggctgcga cttcagaagcgaggcacaggcggtgtggacacggctgcggtgggcggggtcttcgacgtc tccaacgctgaccgcctgggcttctcagaggtggagctggtgcagatggtggtggacgga gtgaagctgctcatcgagatggagcagcggctggagcagggccaggccatcgacgacctc atgcctgcccagaaatga >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_3|331_aa MTVYGGQAAVTGTAAADNSGEVALKAFRPKCQGPWPLPDISTMSFVAYEELIKEGDTAIL SLGHGAMVAVRVQRGAQTQTRHGVLRHSVDLIGRPFGSKVTCGRGGWVYVLHPTPELWTL NLPHRTQILYSTDIALITMMLELRPGSVVCESGTGSGSVSHAIIRTIAPTGHLHTVEFHQ QRAEKAREEFQEHRVGRWVTVRTQDVCRSGFGVSHVADAVFLDIPSPWEAVGHAWDALKV EGGRFCSFSPCIEQVQRTCQALAARGFSELSTLEVLPQVYNVRTVSLPPPDLGTGTDGPA GSDTSPFRSGTPMKEAVGHTGYLTFATKTPG >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_3|996_bp atgacagtgtacgggggccaggcagctgtgacggggacagcagcagcagataacagtggg gaggtggccctcaaggccttcagacctaaatgtcagggtccttggcccttgccagacatt agcaccatgagcttcgtggcatacgaggagctgatcaaggagggtgacacggccatcctg tcactgggccatggtgcaatggtggcggtgcgtgtgcagcgtggggcacagacccagacc cggcatggtgtcctgcggcactcagttgaccttatcggccgccccttcggctccaaggtg acgtgcggccgaggtggctgggtgtatgtgctgcaccccacgcccgagctctggacgctg aacctgccgcaccgcacgcagatcctctactccacagacatcgccctcatcaccatgatg ttggagcttcggcccggctctgtggtctgtgagtctggcaccggcagtggctctgtgtcc cacgccatcatccgcaccattgcacccacgggtcacctgcacacggtggagttccaccag cagcgggcagagaaggcccgggaggagttccaggagcaccgtgtgggccgctgggtgact gtgcgcacccaggacgtgtgccgcagtggctttggcgtgagccacgtggccgacgccgtc ttcctggacatcccatcaccctgggaggccgtgggccacgcctgggacgccctcaaggtc gaaggcgggcgcttctgctccttctcaccgtgcatcgagcaggtgcaacgcacatgccag gcgctggcagcgcgcggcttctcagagctgagcaccctggaggtgctgccacaggtctac aacgtgcgcactgtcagcctgccaccgcccgacctgggcacaggcacagatggccctgcc ggctccgacaccagccccttccgcagcggcacgcccatgaaggaggccgtgggccacacc ggctacctgaccttcgccaccaagaccccaggctag >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_4|248_aa MAPAAPSSGLLTSSWPLSRGRRHSPPLLPLTVPAWLVPPGTGHVLHPQGAEHIRPDHVGF LQEEAAGPGALSKASKPPLACGAQEALAVLALSPTRCLKFFQGLQCARPWLWALLLKSLH FPLHPHVQRNAGCARGAELEGPPGESEHPSFLISFWVSLSSLAISKGMPLHPKALPLCQA PAGTRDGEAVCGTESLPNGVTDPWQLRKDQRIRSGEIASLHFNLTIERYIFAVVIHKWKT YGQNKALV >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_4|747_bp atggctcctgcagccccgagctctggcctgctcacctcttcgtggcctctgtctcggggc cgccggcactccccgcctctcctgcctctgactgtgcctgcctggctggtgcccccaggc actgggcacgtgctccatccccaaggcgctgagcacatcaggccagaccacgtgggcttc ctgcaggaggaggcagcaggcccaggggcactcagcaaagcctccaaacctcccttggct tgtggggcccaggaagccctggcagtcctggcgctgtcccctacccgctgcctgaagttc ttccagggcctccagtgtgcccggccctggctctgggctctgcttttgaaaagtcttcat tttcctttacatccccacgtgcagagaaatgccggctgcgcccgaggagcagagttggag gggcccccgggagaatctgaacatcccagtttcctgatctctttctgggtctcactttca tctctggccatcagcaagggcatgcccctccacccgaaggccctgcctctgtgccaggct ccagcggggaccagagatggagaggcagtgtgcggcactgagtccttaccaaatggggtc acagacccctggcagctccggaaagaccaaagaatccgaagcggggaaatagcttcactt cacttcaaccttactatagaaagatacatttttgcagttgtaattcacaagtggaagacg tacggacagaataaagcactcgtttaa >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_5|502_aa MAPRPLAPAAHGSIATARGAALRQGREARGSAAARPTVRFPSGATGACETEHNKSMDMGN QHPSISRLQEIQKEVKSVEQQVIGFSGLSDDKNYKKLERILTKQLFEIDSVDTEGKGDIQ QARKRAAQETERLLKELEQNANHPHRIEIQNIFEEAQSLVREKIVPFYNGGNCVTDEFEE GIQDIILRLTHVKTGGKISLRKARYHTLTKICAVQEIIEDCMKKQPSLPLSEDAHPSVAK INFVMCEVNKARGVLIALLMGVNNNETCRHLSCVLSGLIADLDALDVCGRTEIRNYRREV VEDINKLLKYLDLEEEADTTKAFDLRQNHSILKIEKVLKRMREIKNELLQAQNPSELYLS SKTELQGLIGQLDEVSLEKNPCIREARRRAVIEVQTLITYIDLKEALEKRKLFACEEHPS HKAVWNVLGNLSEIQGEVLSFDGNRTDKNYIRLEELLTKQLLALDAVDPQGEEKCKAARK QAVRLAQNILSYLDLKSDEWEY >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_5|1509_bp atggccccacgccccctggctcccgcggcgcacggcagcattgcgacggcgcgaggggcc gctttacggcagggtcgcgaagcccgaggaagcgcggcggcgcggccgaccgtgcgcttt cccagcggtgcgacgggtgcttgtgaaactgaacacaacaaaagtatggatatgggaaac caacatccttctattagtaggcttcaggaaatccaaaaggaagtaaaaagtgtagaacag caagttatcggcttcagtggtctgtcagatgacaagaattacaagaaactggagaggatt ctaacaaaacagctttttgaaatagactctgtagatactgaaggaaaaggagatattcag caagctaggaagcgggcagcacaggagacagaacgtcttctcaaagagttggagcagaat gcaaaccacccacaccggattgaaatacagaacatttttgaggaagcccagtccctcgtg agagagaaaattgtgccattttataatggaggcaactgcgtaactgatgagtttgaagaa ggcatccaagatatcattctgaggctgacacatgttaaaactggaggaaaaatctccttg cggaaagcaaggtatcacactttaaccaaaatctgtgcggtgcaagagataatcgaagac tgcatgaaaaagcagccttccctgccgctttccgaggatgcacatccttccgttgccaaa atcaacttcgtgatgtgtgaggtgaacaaggcccgaggggtcctgattgcacttctgatg ggtgtgaacaacaatgagacctgcaggcacttatcctgtgtgctctcggggctgatcgct gacctggatgctctagatgtgtgcggccggacagaaatcagaaattatcggagggaggta gtagaagatatcaacaaattattgaaatatctggatttggaagaggaagcagacacaact aaagcatttgacctgagacagaatcattccattttaaaaatagaaaaggtcctcaagaga atgagagaaataaaaaatgaacttctccaagcacaaaacccttctgaattgtacctgagc tccaaaacagaattgcagggtttaattggacagttggatgaggtaagtcttgaaaaaaac ccctgcatccgggaagccaggagaagagcagtgatcgaggtgcaaactctgatcacatat attgacttgaaggaggcccttgagaaaagaaagctgtttgcttgtgaggagcacccatcc cataaagccgtctggaacgtccttggaaacttgtctgagatccagggagaagttctttca tttgatggaaatcgaaccgataagaactacatccggctggaagagctgctcaccaagcag ctgctagccctggatgctgttgatccgcagggagaagagaagtgtaaggctgccaggaaa caagctgtgaggcttgcgcagaatattctcagctatctcgacctgaaatctgatgaatgg gagtactga >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_6|93_aa MAVSAFHADLAVYACVCTHVCAYVHVCTCQKATLNAEEMADFYKEFLSKNFQKHMYYNRD WYKRNFAITFFMGKVALERIWNKLKQKQKKRSN >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_6|282_bp atggctgtcagcgcattccatgctgacctggccgtgtatgcgtgcgtgtgtacgcatgtg tgtgcgtacgtgcacgtgtgtacatgtcagaaagcaacattgaatgcagaagaaatggcg gacttctacaaggaatttttaagtaaaaattttcagaagcacatgtattataacagagat tggtacaagcgcaattttgccatcaccttcttcatgggaaaagtggccctggaaaggatt tggaacaagcttaaacagaaacaaaagaagaggagcaactag >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_7|130_aa MPRTSAVTATQHRTLLTISRKQQSEQVQPMSQMEKPKGDFDAGPRLRSHTAPPVTAAGYS RAWHAGGIRKYPENKSPRTLLSSQEPMNERSCCSILTCIWCCLESKDHLVVIIRKAIQKQ NFLSEEFKSN >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_7|393_bp atgcccagaacctcggcagtcacggccacgcagcacagaacactgctcaccatctctcgt aagcagcagtcagagcaggtgcaacccatgtcacagatggagaagccgaaaggggacttt gatgcgggccctcggctcagatctcacactgccccacccgtcaccgctgcggggtacagc agggcctggcacgcaggaggtattcggaagtatccagagaataagtcaccaaggaccctg ctgtcctcccaggaaccaatgaacgagcgttcctgttgctccatcctcacctgcatttgg tgttgtttggagagcaaagatcacctggtggtgatcatcaggaaggccatccagaaacaa aacttcttatctgaggaattcaaaagtaattag >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_8|52_aa MWDNVDECQKQEAFTDRAAHKAADPQEPKGNSCLLVSSKKPLAKTDIYKHLQ >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_8|159_bp atgtgggacaatgtggatgaatgtcaaaaacaggaggcattcacagacagggctgcccac aaggctgcagatcctcaagaaccaaagggaaattcctgcctgcttgtctcttccaaaaag ccgctcgcaaagacagacatctacaaacatctacagtaa >gi568815584f:103429979_103634818|GENSCAN_predicted_peptide_9|120_aa MTSCRRAALWGGARRGAGGSNTETPFSAAGAAQAAERDWLGRLGCWCEEPRGCARRPRGQ RVGGRGCCGAVAPAPLAGDCCAVPHTAEAGSAHSRCSARAPGGAPGADSARERGPQEQGX >gi568815584f:103429979_103634818|GENSCAN_predicted_CDS_9|360_bp atgacgtcatgccggcgcgcggcattgtggggcggggcgaggcggggcgccggggggagc aacactgagacgccattttcggcggcgggagcggcgcaggcggccgagcgggactggctg ggtcggctgggctgctggtgcgaggagccgcggggctgtgctcggcggccaaggggacag cgcgtgggtggccgaggatgctgcggggcggtagctccggcgcccctagctggtgactgc tgcgccgtgcctcacacagccgaggcgggctcggcgcacagtcgctgctccgcgcgcgcg cccggcggcgctccaggtgctgacagcgcgagagagcgcggccctcaggagcaaggcgnn