GENSCAN 1.0 Date run: 18-Nov-116 Time: 17:35:05 Sequence gi568815593f:151686621_151904088 : 217468 bp : 42.54% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2482 2571 90 2 0 107 77 28 0.627 2.65 1.02 Intr + 5757 5831 75 1 0 43 97 72 0.556 2.17 1.03 Intr + 9549 9637 89 1 2 75 61 52 0.866 -0.03 1.04 Term + 9783 10094 312 2 0 77 45 214 0.873 10.02 1.05 PlyA + 10687 10692 6 1.05 2.04 PlyA - 10831 10826 6 1.05 2.03 Term - 11055 10833 223 2 1 49 48 132 0.052 0.71 2.02 Intr - 16549 16473 77 1 2 108 82 62 0.150 4.99 2.01 Init - 24317 24249 69 2 0 83 36 53 0.339 0.70 2.00 Prom - 28835 28796 40 -5.25 3.02 PlyA - 28937 28932 6 1.05 3.01 Sngl - 36872 36501 372 2 0 70 50 175 0.698 7.77 3.00 Prom - 50763 50724 40 -6.85 4.03 PlyA - 51104 51099 6 1.05 4.02 Term - 59829 59705 125 1 2 117 43 115 0.943 7.47 4.01 Init - 65138 65084 55 2 1 90 105 59 0.840 9.30 4.00 Prom - 66726 66687 40 -5.05 5.03 PlyA - 67365 67360 6 1.05 5.02 Term - 72159 71843 317 1 2 95 39 206 0.206 10.52 5.01 Init - 72637 72532 106 1 1 75 27 82 0.463 1.23 5.00 Prom - 78760 78721 40 -4.45 6.04 PlyA - 79256 79251 6 1.05 6.03 Term - 79473 79264 210 0 0 36 35 281 0.996 13.91 6.02 Intr - 86402 86245 158 0 2 92 37 84 0.397 2.51 6.01 Init - 90421 90367 55 1 1 73 51 57 0.384 2.00 6.00 Prom - 92125 92086 40 -6.15 7.00 Prom + 93731 93770 40 -5.45 7.01 Init + 100007 100095 89 1 2 68 82 78 0.748 3.46 7.02 Intr + 103703 103784 82 2 1 83 80 109 0.999 8.32 7.03 Intr + 104269 104442 174 0 0 59 87 147 0.933 10.91 7.04 Intr + 107539 107629 91 0 1 82 110 57 0.998 5.95 7.05 Intr + 108859 108955 97 2 1 58 110 157 0.999 13.05 7.06 Intr + 110607 110808 202 0 1 26 55 366 0.982 25.47 7.07 Intr + 112592 112693 102 1 0 75 110 68 0.991 7.15 7.08 Intr + 113269 113380 112 0 1 29 71 51 0.972 -3.47 7.09 Intr + 113598 113726 129 1 0 81 56 60 0.775 1.85 7.10 Intr + 114140 114249 110 0 2 72 98 104 0.994 8.88 7.11 Term + 117265 117471 207 0 0 75 41 250 0.876 15.36 7.12 PlyA + 118026 118031 6 1.05 8.04 PlyA - 118335 118330 6 1.05 8.03 Term - 136343 136053 291 2 0 15 49 300 0.451 13.16 8.02 Intr - 142447 142301 147 1 0 110 102 110 0.919 14.11 8.01 Init - 144860 144648 213 2 0 62 43 108 0.875 2.49 8.00 Prom - 149782 149743 40 -7.05 9.00 Prom + 150867 150906 40 -3.75 9.01 Init + 162265 162409 145 0 1 100 53 101 0.515 8.13 9.02 Term + 163521 163954 434 1 2 32 37 301 0.275 13.87 9.03 PlyA + 164309 164314 6 1.05 10.07 PlyA - 164343 164338 6 -3.94 10.06 Term - 164456 164451 6 2 0 112 42 0 0.824 -5.31 10.05 Intr - 164984 164770 215 0 2 94 115 226 0.991 23.41 10.04 Intr - 168557 168420 138 0 0 77 96 127 0.967 11.91 10.03 Intr - 169763 169681 83 1 2 85 68 55 0.315 1.56 10.02 Intr - 173388 173165 224 0 2 89 109 175 0.471 15.70 10.01 Init - 179575 179516 60 1 0 84 72 22 0.186 1.51 10.00 Prom - 188974 188935 40 -7.45 11.00 Prom + 189329 189368 40 -4.95 11.01 Sngl + 191706 192167 462 2 0 34 37 295 0.888 14.91 11.02 PlyA + 195194 195199 6 1.05 12.08 PlyA - 198340 198335 6 1.05 12.07 Term - 201437 201016 422 0 2 46 43 163 0.531 2.07 12.06 Intr - 204376 204212 165 2 0 74 97 67 0.759 5.21 12.05 Intr - 205818 205691 128 2 2 118 75 79 0.990 9.00 12.04 Intr - 206115 205897 219 2 0 86 31 108 0.708 1.40 12.03 Intr - 209201 209084 118 0 1 60 31 139 0.295 4.00 12.02 Intr - 214709 214684 26 1 2 98 77 10 0.344 -2.35 12.01 Init - 215329 215271 59 1 2 71 99 32 0.465 3.44 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_1|188_aa XTVHRLTLFYFLPITCHSQKPFVSPTRQKLETLSLALRPAYGPPDEPSLGAGLGPELRFL TNSLGECLFAPTPSFEEETEVQKGKVKDLYYWFHSPSAFFQGSVSITLTVQGSLLLWGGS EELGTRLAQSAPTPGSVILPLLVQVQGTLTRRHQFSSETPPTRLEIFSEGGEKPNNPDPT TWQLLHIP >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_1|567_bp ngcacagttcatcgccttacactgttttatttcctccccatcacctgtcattctcagaaa ccatttgtttccccaactagacagaagcttgagactctgtcactagctctgagaccagcc tatggaccacctgatgaaccatctctgggagctggactggggccagaacttcgtttccta acaaactccttgggggaatgtttatttgctccaacaccctcgtttgaagaagaaactgag gttcagaaaggtaaagttaaagatttgtactattggtttcacagcccttcagcctttttc cagggctctgtatccataactttgacggtgcagggctccctgctgctctggggagggtcg gaggagctcggcaccagattggctcagtctgccccaacccccgggtctgtaatattgccg ctgctggtccaggtccagggaacactcacacgcaggcatcaattttcttctgaaacccca cccacacgcctggaaatcttctctgaaggaggggaaaagcccaataacccagacccgaca acgtggcagctcttacacattccttga >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_2|122_aa MEYYSSITKNEILSFAATQMELKSPTDAPHWPIPDGNRGHGSSWVWAISFGVDIGTASCL PAQSDQTESFQQAGSSLPWLLLGSWNSPPSHRALESLRCSGSLDRSPPLSRVHFSSPGEP VG >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_2|369_bp atggagtactattcatctataacaaagaatgagatcctgtcatttgcagcaacacagatg gaactgaagtcgcctactgatgctccccactggccaatcccagatggaaacagaggccat gggagctcatgggtgtgggccataagcttcggggtggacatagggactgccagctgcctg cctgcccagagtgaccaaactgaaagcttccagcaagcaggctcgagtcttccgtggctt ctgcttggatcttggaactctcccccttcccacagagccctggagagcttgcgatgctct gggtctttggatagatcaccccctctctccagggttcatttttcttcacctggagagcca gtgggctga >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_3|123_aa MKDSPEIELGAAACWSEQSSKDGKESWGSPASLHFTPLISRLSITLGHSDLQGPLCVRLR AHSSYGGDALVQGSEEAYKDHFIQSASPLCHSVSDSIFADVFVLKCKAIWEEPLSESLVA SFP >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_3|372_bp atgaaggattctccagaaattgagttaggtgcagccgcatgctggtcagaacagtctagc aaagatggcaaagagagctggggctcgcctgcatctctgcatttcacacctctgatctcg agactcagcatcacccttggccacagtgatctccaagggcctctgtgtgtcaggctcagg gcacacagtagctatgggggagatgccctggtgcagggttctgaggaggcctacaaagat cacttcatccaatcggcatccccactgtgtcactctgtgtctgactccatctttgctgat gtgtttgtgttaaaatgcaaagccatctgggaagaaccattgtcagaatccttggtagca tccttcccctga >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_4|59_aa MTCGGCAEAVSRVLNKLGGVKYDIDLPNKKVCIESEHSMDTLLATLKKTGKTVSYLGLE >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_4|180_bp atgacctgtggaggctgtgctgaagctgtctctcgggtcctcaataagcttggaggagtt aagtatgacattgacctgcccaacaagaaggtctgcattgaatctgagcacagcatggac actctgcttgcaaccctgaagaaaacaggaaagactgtttcctaccttggccttgagtag >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_5|140_aa MDMNCVPANSYVDTLTPNATVFRDKACNRAIKANRGRVTTGPARRKEAARPAGIRTGGGG ASLAAGPGVGGASGTRPLVRTLGSAAEPALRIQRGAADTAATPPPHRRCLSHAGEWLRRP RHADLQSRVASRLHSRLTGC >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_5|423_bp atggacatgaattgtgtccccgcaaattcctatgttgataccctaacccccaatgcgact gtatttagagacaaagcctgtaacagggcgataaaggctaaccgaggccgggtgaccacg ggcccggcccgcagaaaggaggcggcgcggccggcagggatcaggactggagggggcgga gcctccctcgcggccgggcccggagtgggaggggcctccgggaccaggcctctagtgcgc actctcggaagcgcagccgaacccgccctccgaatccagagaggcgctgctgacaccgcc gccacaccgccgccacaccgccgctgcctcagtcatgccggtgagtggttgcgccgtccc cgccacgcagacttgcagtcccgggttgcttctcggctgcacagccgcttgaccggatgc tga >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_6|140_aa MVDSGPETGKMQDEPEIPEATVAAHLPRACTSTRRKEAQTLFLCHIPTSNPCLPGSPELA QVQMPLQPSRQALANVNIGSLICNVGAGGPALAAGAAPAGGPAPSIAAASAEEKKMEAKK EESEESDDDMGFGLFTKPVL >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_6|423_bp atggttgattctggacctgagacaggcaaaatgcaagatgagcctgaaatacctgaagct acggtggcggcacatctacccagggcctgtacaagtacacgtagaaaggaagcccaaacc cttttcctttgtcacattcccacttccaacccttgtctccctggaagcccagaactagcc caagtacaaatgccgctgcagccttctcggcaggccctggccaacgtcaacattggaagc ctcatctgcaatgtaggggctggtggacctgctctagcagctggtgctgcaccagcagga ggtcctgccccctccattgctgctgcttcagctgaggagaagaaaatggaagcaaagaaa gaagaatctgaggagtctgatgatgacatgggctttggtctttttactaaacctgtttta taa >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_7|464_aa MEKPSPLLVGREFVRQYYTLLNQAPDMLHRFYGKNSSYVHGGLDSNGKPADAVYGQKEIH RKVMSQNFTNCHTKIRHVDAHATLNDGVVVQVMGLLSNNNQALRRFMQTFVLAPEGSVAN KFYVHNDIFRYQDEVFGGFVTEPQEESEEEVEEPEERQQTPEVVPDDSGTFYDQAVVSND MEEHLEEPVAEPEPDPEPEPEQEPVSEIQEEKPEPVLEETAPEDAQKSSSPAPADIAQTV QEDLRTFSWASVTSKNLPPSGAVPVTGIPPHVVKVPASQPRPESKPESQIPPQRPQRDQR VREQRINIPPQRGPRPIREAGEQGDIEPRRMVRHPDSHQLFIGNLPHEVDKSELKDFFQS YGNVVELRINSGGKLPNFGFVVFDDSEPVQKVLSNRPIMFRGEVRLNVEEKKTRAAREGD RRDNRLRGPGGPRGGLGGGMRGPPRGGMVQKPGFGVGRGLAPRQ >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_7|1395_bp atggagaagcctagtcccctgctggtcgggcgggaatttgtgagacagtattacacactg ctgaaccaggccccagacatgctgcatagattttatggaaagaactcttcttatgtccat gggggattggattcaaatggaaagccagcagatgcagtctacggacagaaagaaatccac aggaaagtgatgtcacaaaacttcaccaactgccacaccaagattcgccatgttgatgct catgccacgctaaatgatggtgtggtagtccaggtgatggggcttctctctaacaacaac caggctttgaggagattcatgcaaacgtttgtccttgctcctgaggggtctgttgcaaat aaattctatgttcacaatgatatcttcagataccaagatgaggtctttggtgggtttgtc actgagcctcaggaggagtctgaagaagaagtagaggaacctgaagaaagacagcaaaca cctgaggtggtacctgatgattctggaactttctatgatcaggcagttgtcagtaatgac atggaagaacatttagaggagcctgttgctgaaccagagcctgatcctgaaccagaacca gaacaagaacctgtatctgaaatccaagaggaaaagcctgagccagtattagaagaaact gcccctgaggatgctcagaagagttcttctccagcacctgcagacatagctcagacagta caggaagacttgaggacattttcttgggcatctgtgaccagtaagaatcttccacccagt ggagctgttccagttactgggataccacctcatgttgttaaagtaccagcttcacagccc cgtccagagtctaagcctgaatctcagattccaccacaaagacctcagcgggatcaaaga gtgcgagaacaacgaataaatattcctccccaaaggggacccagaccaatccgtgaggct ggtgagcaaggtgacattgaaccccgaagaatggtgagacaccctgacagtcaccaactc ttcattggcaacctgcctcatgaagtggacaaatcagagcttaaagatttctttcaaagt tatggaaacgtggtggagttgcgcattaacagtggtgggaaattacccaattttggtttt gttgtgtttgatgattctgagcctgttcagaaagtccttagcaacaggcccatcatgttc agaggtgaggtccgtctgaatgtcgaagagaagaagactcgagctgccagggaaggcgac cgacgagataatcgccttcggggacctggaggccctcgaggtgggctgggtggtggaatg agaggccctccccgtggaggcatggtgcagaaaccaggatttggagtgggaagggggctt gcgccacggcagtga >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_8|216_aa MVDTPPPTKLKCSRSTSDCCAGSENFKPVDLSLLGSVGMGSAELNHLAPWLQPPFQGSEW FCLAGVPGATGVSYVKAIDIWMAVCLLFVFSALLEYAAVNFVSRQHKELLRFRRKRRHHK EDEAGEGRFNFSAYGMGPACLQAKDGISVKGANNSNTTNPPPAPSKSPEEMRKLFIQRAK KIDKISRIGFPMAFLIFNMFYWIIYKIVRREDVHNQ >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_8|651_bp atggtggacactcctccccccaccaagctcaagtgttccaggtcaacttcagactgctgt gctggcagcgagaatttcaagccagtggatcttagcttgctgggctccgtggggatggga tctgctgagctaaatcacttggctccctggcttcagccccctttccaggggagtgaatgg ttctgtcttgctggggttccaggtgccactggggtgtcctatgtgaaagccattgacatt tggatggcagtttgcctgctctttgtgttctcagccctattagaatatgctgccgttaac tttgtgtctcggcaacataaggagctgctccgattcaggaggaagcggagacatcacaag gaggatgaagctggagaaggccgctttaacttctctgcctatgggatgggcccagcctgt ctacaggccaaggatggcatctcagtcaagggcgccaacaacagtaacaccaccaacccc cctcctgcaccatctaagtccccagaggagatgcgaaaactcttcatccagagggccaag aagatcgacaaaatatcccgcattggcttccccatggccttcctcattttcaacatgttc tactggatcatctacaagattgtccgtagagaggacgtccacaaccagtga >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_9|192_aa MTASAHSHLNKGIKQVYMSLSQGEKVQAMYVWIDGTGEGWRCKTQTLDRPQGPYYCSVGA GKAHGRDIVEAHYQACLYAGAKMVGTNAEVMPAQWEFQTGPCERISMGDHFWVGCFILHH VCEDFGVIATFDPKPIPGNRKCSGCHTNFSTKAMWEENGLKYTEEAIEKLSSGTSTTSTL MIPREAWTKPNT >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_9|579_bp atgaccgcctcagcacattcccacttaaacaaaggtattaagcaggtgtacatgtccctg tctcagggtgagaaagtccaggccatgtatgtctggatcgatggtactggagaaggatgg cgctgcaagacccagactctggacagaccccagggtccatattactgcagtgtgggagca ggcaaagcccatggcagggacatcgtggaggcccattaccaggcctgcttgtatgctgga gccaagatggtgggtactaatgctgaggtcatgcctgcccagtgggaatttcaaaccgga ccctgtgaaagaatcagcatgggagatcatttctgggtgggctgtttcattttgcatcat gtatgtgaagactttggagtgatagcaacctttgatcccaagcccattcctgggaacagg aagtgttcaggctgccataccaacttcagcaccaaggccatgtgggaggagaatggtctg aagtacactgaggaggccatcgagaaactaagcagtggcaccagtaccacatccacgctt atgatcccaagggaggcctggacaaagcccaacacctag >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_10|241_aa MGELMLNTVESELWKSLLLQDYRVNIFLRQQWNDPRLAYNEYPDDSLDLDPSMLDSIWKP DLFFANEKGAHFHEITTDNKLLRISRNGNVLYSIRITLTLACPMDLKNFPMDVQTCIMQL ESFGYTMNDLIFEWQEQGAVQVADGLTLPQFILKEEKDLRYCTKHYNTGKFTCIEARFHL ERQMGYYLIQMYIPSLLIVILSWISFWINMDAAPARVGLGITTVLTMTTQSSGSRASLPK G >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_10|726_bp atgggtgagttgatgctgaacactgttgaatcagagctctggaaatccctcctgcttcag gactatagggtcaacatcttcctgcggcagcaatggaacgacccccgcctggcctataat gaataccctgacgactctctggacctggacccatccatgctggactccatctggaaacct gacctgttctttgccaacgagaagggggcccacttccatgagatcaccacagacaacaaa ttgctaaggatctcccggaatgggaatgtcctctacagcatcagaatcaccctgacactg gcctgccccatggacttgaagaatttccccatggatgtccagacatgtatcatgcaactg gaaagctttggatatacgatgaatgacctcatctttgagtggcaggaacagggagccgtg caggtagcagatggactaactctgccccagtttatcttgaaggaagagaaggacttgaga tactgcaccaagcactacaacacaggtaaattcacctgcattgaggcccggttccacctg gagcggcagatgggttactacctgattcagatgtatattcccagcctgctcattgtcatc ctctcatggatctccttctggatcaacatggatgctgcacctgctcgtgtgggcctaggc atcaccactgtgctcaccatgaccacccagagctccggctctcgagcatctctgcccaag ggctaa >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_11|153_aa MWLLLKALGFIREAEHKNLENLQPDNVLEKKNPFSEEKLKPAEEICISNEEPNVNPQDNG ENVSGACQRSSQQPLPSQAQRLWRKSFRGPSPGSLCCVRSRDLVPCVPATPAMTKRCQST IQVVSSEGGSPKPWQLPRGIEPAGAQKSRIEVW >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_11|462_bp atgtggttgctgttaaaggcactcggttttataagggaagcagagcataaaaatttggaa aatttgcagcctgacaatgtgctagaaaagaaaaacccattttctgaggagaaattgaag ccagctgaagaaatttgcataagtaatgaggagccaaatgttaatccccaagacaatggg gaaaatgtctccggagcatgtcagaggtcttcacagcagccactcccatcacaggcccaa aggctttggagaaaatcgtttcgtgggccaagcccagggtccctgtgctgtgtgcggtct agagacttggtgccctgtgtcccagccactccagccatgactaaaaggtgccaaagtaca attcaggttgtttcttcagaaggtggaagccccaagccttggcagcttccacgtggcatt gagcctgcgggtgcacagaagtcaagaattgaagtttggtaa >gi568815593f:151686621_151904088|GENSCAN_predicted_peptide_12|378_aa MGFRDQLSHGATTYKLHNLRTRLQNNTEEPHSSHESSAEWTVAPTVGHGKHLGGFSSHVG SMVQLKALDTQLPSVITQNPRDEALKMSVTSQHQSDTWLPARIPCRLIFGLGKQFNLGLT SLRTHITHITPAKNTLVVLQSLAASKEAEAARSAPKPMSPSDFLDKLMGRTSGYDARIRP NFKECLLRTALQIENVLRDAPRSLFGPTLPIYLEAPSLPLHLVPEAIASSIVNITPRIGT RILPCLPQYLIGIKKVDLKSKSQIANYNLYVKGCDHLSKSYSPSKSSPGHHLSEVFRALF SRTSLTWTPAAFTTLTHGLAFLFGYFMSALFVSLFGNPVLKILDSATVCVIVSMPLPSSK PVSYRIAIKMSTLGVRQI >gi568815593f:151686621_151904088|GENSCAN_predicted_CDS_12|1137_bp atgggcttcagagaccagttgtctcatggggctaccacttacaaactgcataacctcagg accagactgcagaataacactgaagagccacacagcagccatgagtccagtgctgaatgg acagtggccccaactgtgggccatgggaagcacctgggagggttttccagccatgtgggc tccatggtgcagctgaaagcattggacacccagctgccaagtgtgattactcaaaatccc agagatgaagccttaaaaatgtccgtgacttctcagcatcaatcagacacttggctgcct gccaggatcccatgccggctgatatttgggctgggtaaacaatttaacctgggccttacc tcattaaggacccatattacccatattactccagctaagaacactcttgtggttctacag agccttgctgcttctaaggaggctgaagctgctcgctccgcacccaagcctatgtcaccc tcggatttcctggataagctaatggggagaacctccggatatgatgccaggatcaggccc aattttaaagagtgcttgctccgcacagctctgcaaatagaaaatgttctgagagatgca cccaggtctctgtttgggcccactctccccatttaccttgaggctccttcactccccctg caccttgttcctgaggccatagcttccagcattgttaacataacccccaggataggaaca agaattctgccctgtctacctcagtacctcataggaatcaagaaagtggacctgaaaagc aaatcacaaattgcaaattacaatctttatgtgaagggctgtgaccatttatccaagtcc tattcaccttcaaagtcatctccaggccaccacctttctgaggtctttcgtgccttattc tccagaacttctctcacctggactcctgctgcttttaccactcttacccatggtcttgca tttctctttggttattttatgtcagccctttttgtcagcttatttggaaacccagtgctg aaaattctggactctgccactgtctgtgtgatcgtgagcatgccccttccttcctctaaa cctgtttcctacagaatagccattaagatgagcacccttggagttagacaaatctga